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OBJECTIVE  : 

The  vision  for  the  Center  for  Bio-Inspired  Nanomaterials  (CBIN)  at  Montana  State  University 
builds  upon  a  strong  scientific  foundation  and  a  proven  multidisciplinary  team  in  the 
development  of  protein  cage  architectures  as  templates  for  nanomaterials  synthesis  and 
applications  in  nanotechnology.  The  exterior  interface,  the  interior  interface,  and  the  interface 
between  protein  subunits  can  all  be  manipulated  in  the  pursuit  of  refined  nanoparticle  properties. 
The  synthesis  and  characterization  capabilites  of  CBIN  will  be  developed  as  a  foundation  to 
provide  nanomaterials  with  novel  optical,  electronic,  magnetic,  and  acoustic  properties.  These 
materials  will  be  investigated  as  the  basis  for  applications  in  magnetic  sensing,  biofilm 
antifouling,  and  nanocatalysis.  Our  previous  success  in  utilizing  protein  cages  as  size  and  shape 
constrained  templates  for  synthesis  of  a  wide  range  of  nanostructures  has  led  to  an  understanding 
that  protein  cage  dynamics  plays  an  important  role  in  the  overall  properties  of  the  materials.  The 
long-term  goal  is  to  use  this  knowledge  to  guide  the  development  of  a  new  generation  of  active 
and  responsive  nanomaterials.  The  directed  program  objectives  of  CBIN  are: 


•  To  use  protein  cages  and  architectures  for  the  formation  of  nano-materials  with 
applications  in  magnetic  sensing. 

•  To  use  protein  cages  and  architectures  to  create  nano-catalysts  with  high  specificity  and 
efficiency  for  hydrogen  production  and  metal  ion  remediation. 

•  To  use  the  library  of protein  cages  and  architectures  to  create  nanomaterials  for 
targeting  biofilms  for  imaging  and  anti-fouling. 


APPROACH:  The  Center  for  Bio-Inspired  Nanomaterials  (CBIN)  has  facilitated  the 
development  of  a  campus-wide  infrastructure  for  the  training  of  personell  (post-docs,  graduate 
students,  and  undergraduates)  and  for  collaborative  multidisciplinary  research.  Through  CBIN, 
the  infrastructure  is  in  place  for  large-scale  production  and  synthesis  of  materials  and  their 
characterization.  Materials  are  produced  using  genetic  and  chemical  modification  of  protein 
cage  materials  and  controlled  chemical  synthesis  under  mild  biological  conditions.  High- 
resolution  structural  techniques  (TEM,  X-ray  crystallography,  NMR)  and  spectroscopy  to  probe 
electronic  structure  (XAS,  EPR,  ACMS)  will  be  coupled  with  functional  investigation  of  the 
materials  properties.  Lithography 

ACCOMPLISHMENTS: 

Protein  Cage  Architectures  for  Applications  in  Nanomaterials 

Protein  cage  architectures  provide  a  unique  platform  for  synthesis  of  size  and  composition 
constrained  nanomaterials.  We  have  shown  that  we  can  utilize  charged  interfaces  in  the  protei  to 
direct  nucleation  and  crystallization  of  nanomaterials  constrained  by  a  variety  of  protein  cage 
architectures.  In  particular,  using  this  approach  we  have  shown  the  size  constrained  synthesis  of 
magnetic  materials  by  growing  nanoparticles  of  transition  metal  oxide  materials.  These 
materials  show  unusual  size  dependent  properties.  Recently  we  have  demonstrated  that  the 
multifunctional  protein  cage  platform  can  be  used  to  attach  individual  paramagnetic  centers  in  a 
site  specific  manner  to  a  viral  protein  cage.  The  magnetic  behaviour  of  thes  resulting  materials 
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show  some  remarkable  and  unexpected  behaviour.  Firstly  using  a  genetically  incorporated 
peptide  for  Gd  binding  we  were  able  to  make  a  viral  capsid  showing  very  high  proton  relaxivity. 
In  contrast,  the  attachment  of  metal  chelates  to  the  viral  capsid  (at  very  high  density  -  180  per 
cage)  showed  significantly  lower  proton  relaxivity.  Our  analysis  shows  that  local  mobility  of  the 
Gd  ion  plays  a  huge  role  in  the  composite  magnetic  properties  of  the  capsid-based  material. 
Future  work  is  directed  at  utilizing  this  observation/analysis  to  make  a  new  class  of  materials 
with  directed  magnetic  properties  based  on  a  combination  of  controlled  mobility  and  metal 
ligand  interactions. 

Using  the  exterior  surface  of  the  CCMV  viral  cage  we  have  chemically  attached  ligands  that 
direct  the  cage  specifically  to  the  surface  of  a  bacterial  cell.  The  targeting  moiety  used  in  this 
case  was  an  antibody  specific  to  protein  A  expressed  on  the  surface  of  the  biofilm  forming 
bacteria,  Stapholococcus  aureus.  As  part  of  our  efforts  to  produce  active  multifunctional 
nanomaterials  a  photosensitizor  ,  a  (Ru(bpy)32<  derivative.  This  photosensitizor  acts  as  a  catalyst 
for  singlet  oxygen  production  and  results  in  the  selective  killing  of  S.  aureus  only  when  coupled 
to  the  cell  specific  targeting.  The  importance  of  these  results  lies  in  our  demonstrated  ability  to 
combine  self-assembly,  targeting,  and  functional  catalytic  activity  into  a  single  nanoparticle 
platform.  Future  work  will  be  directed  towards  targeting  of  other  DoD  relavant  biofilm  forming 
organisms  together  with  delivery  of  selectively  antibacterial  agents. 


Protein  Cage  Dynamics 

Protein  function  is  intimately  connected  to  dynamics  and  therefore  knowledge  of  the  frequency, 
range,  and  coordination  of  motion  by  supramolecular  complexes  is  critical  to  understanding 
function  and  the  development  of  bio-inspired  nanomaterials.  We  are  using  protein  cages  as  a 
paradigm  for  studying  protein  dynamics  in  supramolecular  complexes.  The  extremely  large  size 
and  icosahedral  architecture  of  virus  capsids  limit  the  use  of  many  standard  techniques  for 
studying  protein  motion.  To  overcome  these  problems,  we  employ  an  array  of  biophysical 
techniques  to  investigate  the  solution  phase  behavior  of  viruses.  Kinetic  hydrolysis,  an  approach 
being  developed  in  the  Bothner  lab,  is  a  straight-forward  and  powerful  technique  for  identifying 
the  dynamic  regions  within  a  single  protein  or  in  the  context  of  a  multi-component  complex. 
Protein  dynamics  is  being  investigated  at  three  levels:  the  dynamics  of  the  subunit,  the  assembled 
cage  architecture,  and  the  dynamics  associated  with  higher  order  particle/particle  and 
surface/particle  interactions. 

The  function  of  viral  capsids,  and  macromolecular  complexes  in  general,  requires 
conformational  freedom.  The  work  that  has  been  accomplished  in  the  past  year  contains  a 
number  of  major  technical  advances  that  allow  us  to  understand  the  assembly  and  function  of  a 
protein  cage.  Hepatitis  B  Virus. 

These  include: 

1 .  The  first  rigorous  measurements  of  localized  rates  and  equilibria  for  dynamic  motion  within 
a  megadalton  complex. 

2.  The  biologically  important  C-terminal  region  of  HBV  capsid  protein  is  dynamic  in  both 
dimer  and  capsid  forms.  This  is  in  contrast  to  the  assignment  as  a  well-ordered  helix  in  X- 
ray  and  cryo-EM  based  models. 
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3.  This  region  is  a  switch  that  regulates  capsid  assembly  and  function,  with  thermodynamic 
profiles  that  invert  depending  on  the  assembly  state. 

4.  Enzymatic  hydrolysis  (kinetic  hydrolysis),  which  has  to  date  been  applied  largely  to  model 
systems,  can  be  extended  to  study  large  multi-component  complexes:  this  biophysical 
technique  may  be  the  most  relevant  tool  for  studying  the  structural  and  stability  of 
nanomaterials. 


Hydrogen  Catalyst  Thrust: 

Nature  has  evolved  very  active  catalysts  for  H2  production  in  the  form  of  the  Fe-containing 
hydrogenases  which  can  produce  6000-9000  molecules  of  hydrogen  per  second  for  a  single 
molecule  of  enzyme.  Light  energy  can  be  harnessed  and  coupled  to  the  reduction  of  a 
hydrogenase  using  an  electron  transfer  mediator  such  as  methyl  viologen  using  light  activated 
porphyrins  or  ruthenium  (II)  complexes.  We  have  been  able  to  achieve  hydrogen  production  in 
an  in-vitro  assay  containing  hydrogenase,  semiconductors  (or  molecular  light  harvesting 
antennae)  with  methyl  viologen  as  a  redox  mediator  and  simple  organic  acids  as  a  primary 
source  of  reducing  equivalents.  The  in-vitro  complex  system  serves  as  an  ideal  platform  for  the 
characterization  and  optimization  of  hydrogen  production  catalysts,  photocatalysts,  and  coupling 
mechanisms.  We  have  been  examining  in  detail  the  structural  features  of  examples  of  members 
of  both  the  [NiFe]-  and  [FeFe]-  classes  of  hydrogenase.  These  studies  are  aimed  at  1) 
understanding  the  structural  determinants  for  catalysis  which  provide  critical  information  for  the 
design  and  synthesis  of  biomimetic  catalysts  and  2)  understanding  the  structural  determinants  of 
enzyme  stability  providing  the  necessary  information  for  the  identification  and  engineering  of 
stable  hydrogenases  to  be  utilized  in  biohybrid  materials. 

1)  Our  [FeFe]-hydrogenase  structural  studies  are  at  an  advanced  stage  with  a  number  of 
published  structures  of  the  enzyme  from  Clostridium  pasteurianum  are  currently  working  the 
detailed  structural  analysis  at  better  than  1.4  A  resolution.  The  preliminary  analysis  of  this 
structure  together  with  complementary  computational  studies  conducted  by  the  Szilagyi  group 
have  provide  the  basis  for  critical  examination  of  the  composition  of  the  unique  dithiolate  ligand 
of  the  cluster  with  paradigm  challenging  implications.  The  suggestion  that  the  cluster  exists 
with  an  ether  linked  dithiolate  has  significant  impact  on  mechanistic  considerations  of  H  cluster 
catalyzed  reversible  hydrogen  oxidation  and  provides  the  basis  for  the  design  of  mimetic  and 
biochemical  experiments  to  examine  ligand  assignments  directly. 

2)  Another  avenue  of  our  work  has  focused  on  using  the  high  resolution  structure  as  well  as 
genomic  analysis  of  the  variances  in  amino  acid  sequence  of  hydrogenases  from  different 
microbial  sources  as  a  platform  for  identifying  the  structural  determinants  for  hydrogenase 
stability  and  identifying  appropriate  model  systems  for  use  in  materials  synthesis.  We  have 
compared  through  homology  modeling,  [FeFe]-hydrogenases  from  both  mesophilic  and 
thermophilic  as  well  as  from  strict  and  facultative  anaerobes  to  develop  hypotheses  for  enzyme 
thermal  adaptation  and  oxygen  tolerance.  In  addition,  we  have  worked  from  the  observations 
made  by  Posewitz,  King,  and  coworkers  in  which  it  was  shown  that  [FeFe]-hydrogenases  can  be 
successfully  expressed  in  E.  coli  when  the  structural  gene  products  for  the  enzyme  are  expressed 
in  a  background  in  which  the  accessory  enzymes  HydE,  HydG,  and  HydF  are  coexpressed.  We 
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have  been  using  this  expression  system  to  examine  the  heterologous  expression  of  active 
hydrogenases  from  a  variety  of  microbial  sources.  To  date,  we  have  examined  [FeFe]- 
hydrogenases  from  five  different  thermophile  sources.  We  have  constructed  the  appropriate 
clones  to  heterologously  express  a  number  of  hydrogenases  from  thermophiles  and 
hyperthermophiles.  Unfortunately,  to  date  we  have  been  unsuccessful  in  the  heterologous 
expression  of  [FeFe]-hydrogenases  with  detectable  [FeFe]-hydrogenases,  an  interesting 
observation  since  the  system  has  worked  well  for  a  number  of  hydrogenases  from  mesophiles. 
These  observations  have  motivated  us  to  revisit  the  amino  acid  sequence  and  genomic  context  of 
thermophile  [FeFe]-hydrognease  as  well  as  the  overall  metabolism  of  thermophiles  possessing 
these  [FeFe]-hydrogenases.  The  results  of  this  analysis  suggests  that  the  [FeFe]-hydrogenases  in 
these  organisms  may  be  functioning  physiologically  as  hydrogen  oxidation  catalysis  in  these 
organisms.  In  addition,  amino  acid  sequence  differences  are  noted  at  key  positions  in  the 
enzymes  active  sites  of  thermophile  [FeFe]-hydrogenases  when  compared  to  their  mesophile 
counterparts.  We  are  currently  moving  forward  with  the  working  hypothesis  that  the  subtle 
differences  allow  the  [FeFe] -hydrogenases  to  function  more  effectively  in  hydrogen  oxidation  in 
these  thermophile  hosts  and  we  are  testing  this  hypothesis  by  amino  acid  substitution 
experiments. 


Magnetic  Nanoparticle  thrust: 

The  first  major  accomplishments  for  this  period,  reported  in  the  Journal  of  Applied  Physics,  is 
the  extension  of  our  work  in  the  application  of  the  first  principles  understanding  of  magnetic 
damping  for  itinerant  ferromagnets.  We  have  shown  that  there  is  a  strong  correlation  between  the 
density  of  states  and  the  damping  rate  and  have  shown  that  two  mechanisms  are  responsible  for 
the  temperature  dependence  of  the  damping,  one  an  inter-band  and  the  second  an  intra-band 
excitation.  We  can  now  calculate,  from  first  principles  the  damping  rate  as  the  electronic 
bandstructure  is  modified  through  doping  or  strain,  allowing  for  a  predictive  tool  for  engineered 
damping  structures. 

A  second  major  accomplishment,  submitted  to  the  Journal  of  Magnetism  and  Magnetic  Materials, 
is  our  development  of  angle  dependent  Electron  Magnetic  Resonance  measurements  of  different 
sized  Fe-oxide  nanoparticles  synthesized  inside  different  sized  protein  cage  structures.  By 
cooling  a  suspension  of  encapsulated  magnetic  nanoparticles  in  an  applied  field  below  the 
solution  freezing  temperature,  we  are  able  to  show  that  the  nanoparticles  are  locked  to  the  protein 
cage  structure  and  that  the  surface  anisotropy  term  is  large.  As  the  particle  size  is  decreased,  the 
increase  in  the  surface-to-volume  ratio  allows  us  to  separate  the  contribution  of  the  surface 
anisotropy  from  the  bulk  anisotropy.  This  has  important  implications  in  using  magnetic 
nanoparticles  as  elements  of  a  sensor  technology. 

A  third  major  accomplishment,  not  submitted  yet,  is  the  identification  of  the  doping  site  for  Mn 
and  Zn  doping  (in  a  range  of  0-33%)  of  Fe-oxide  magnetic  nanoparticles.  By  using  X-ray 
absorption  spectroscopy  and  magnetic  circular  dichroism,  we  find  that  both  the  Zn  and  the  Mn 
substitute  into  an  octahedral  site  with  a  +2  valance  state.  Because  of  this  substitution,  the 
magnetic  moment  is  controllable  from  32  mg  per  unit  cell  down  to  zero.  This  demonstrates  that 
the  magnetic  moment  of  these  particles,  as  well  as  the  anisotropy  values,  are  controllable  through 
doping. 
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We  have  developed  new  methods  to  characterize  the  cage  structures  and  the  mineralized  cores, 
both  while  encapsulated  and  after  protein  removal,  for  the  successful  synthesis  of  uniform 
magnetic  nanoparticles  of  controllable  shape,  size,  composition,  and  magnetic  properties.  We 
have  successfully  generated  microscale  patterns  of  antibody  and  protein  cage  on  silicon  and  gold 
substrates  using  both  microplotter  and  FIB-based  patterning  technologies.  The  results  are  as 
follows: 

Patterning  by  microplotter  was  achieved  by  depositing  a  preselected  antibody  solution  directly 
onto  chemically  activated  silicon  or  gold  surfaces  after  modifying  these  surfaces  with  maleimido- 
or  NHS-  groups.  The  spot  size  achieved  by  micro-plotter  is  a  function  of  many  parameters, 
including  the  tip  diameter  of  the  micro-plotter,  the  surface  hydrophobicity,  and  the  optimization 
of  experimental  parameters  such  as  the  quality  of  surface  modification,  concentration  of  proteins, 
plotting  speed,  and  voltage  applied  to  the  driving  piezo.  Using  an  optimized  microplotter, 
microarrays  of  bacterial  cells  with  a  spot  size  of  25  pm  have  been  achieved  on  maleimido- 
modified  silicon  surfaces,  an  example  of  which  is  shown  in  Fig.  la  below. 

FIB-based  patterning  was  achieved  using  a  focused  Ga+  beam  to  create  patterns  on  a  silicon  or 
gold  substrate  that  was  pre-passivated  with  molecules  that  resist  the  nonspecific  binding  of 
proteins.  We  have  successfully  achieved  a  lateral  resolution  of  ~1  pm  and  can  prepare  patterns 
with  sizes  anywhere  from  1  pm  to  200  pm.  We  applied  this  approach  to  the  immobilization  of 
Sulfolobus  turreted  icosahedral  virus  (STIV),  a  cage-shaped  protein  with  a  diameter  of  ~  50nm, 
on  pre-modified  silicon  substrates.  Although  the  attachment  of  STIV  cages  to  silicon  was 
observed,  we  observed  low  coverage  density  of  STIV  on  the  surface,  presumably  due  to  the  poor 
purity  of  the  STIV  cages  used.  We  believe  this  is  because  the  STIV  sample  used  in  our 
experiments  most  likely  contained  not  only  the  protein  cages  but  also  other  small  proteins  that 
competed  with  the  attachment  of  STIV  to  the  substrate  surface.  This  competitive  binding  reduced 
the  density  of  STIV  cages  on  the  surface.  We  expect  to  improve  the  density  by  using  high  purity 
protein  cages  (i.e.  tobacco  mosaic  virus  (TMV).  This  work  is  under  active  consideration  as  time 
permitting. 


Computational  Thrust: 

A  systematic  data  collection  was  carried  out  at  the  Stanford  Synchrotron  Radiation  Laboratory 
for  commercially  available,  freshly  prepared  bulk,  and  protein-cage  assisted  synthetic  iron  and 
molybdenum  oxide  and  sulfide,  samples.  Qualitative  data  comparison  further  support  earlier 
observations  by  CBIN  members  that  protein  cages  allow  for  accessing  areas  of  iron  oxide/sulfide 
phase  diagrams  that  would  not  be  possible  using  conventional  synthetic  methods.  Using  the 
complementarity  of  the  X-ray  absorption  data  collected  at  the  iron  K-edge  (EXAFS),  iron  L-,  and 
ligand  K-edges  (XANES),  we  began  quantitatively  determine  the  structural  differences  among 
various  species. 

With  respect  our  computational  work  we  joined  efforts  with  the  Peters  group  on  mapping  the 
soft/hard  material  interface  of  iron-sulfur  clusters  embedded  into  a  protein  matrix.  In  this 
collaboration,  we  employed  computational  chemical  tools  and  approaches  developed  by  us  to 
probe  a  key  question  related  to  the  active  site  of  biological  hydrogen  production.  We  carried  out 
an  unbiased  investigation  of  the  chemical  composition  of  the  catalytically  active  iron-sulfur 
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cluster  of  FeFe-hydrogenase  and  found  indications  for  a  new  chemical  composition  that  has  not 
been  critically  explored  experimentally.  The  CBIN  funding  was  essential  to  achieve  this 
allowing  the  establishment  of  a  virtual  chemical  model  of  the  active  site  that  captures  the  most 
important  interactions  between  the  iron-sulfur  active  site  and  protein  environment. 


SIGNIFICANCE  :  CBIN  has  facilitated  the  collaborative  interaction  between  biologists, 
biochemists,  chemists,  and  physicists  focused  on  the  use  and  understanding  of  constrained 
reaction  environments  provided  by  viruses  and  other  protein  cage  architectures.  This  bio¬ 
inspired  approach  utilizes  and  exploits  hard-soft  interfaces,  particle-surface  interactions  and 
controlled  hierarchical  assembly  for  the  synthesis  of  functional  nanomaterials.  The  underlying 
design  principles  are  based  on  an  understanding  of  biological  systems.  In  addition,  we  are  also 
using  biological  materials  as  templates  for  the  construction  of  materials  with  superior  properties. 
Thus,  control  over  material  synthesis  results  in  magnetic  materials  with  unique  properties, 
catalytic  materials  with  higher  efficiency,  and  materials  that  can  be  used  to  both  integrate  and  kill 
bacterial  biofilms,  superior  to  those  produced  by  other  methods.  We  have  established  a 
productive  team  that  spanning  many  disciplines  and  able  to  train  the  next  generation  of  scientists 
and  respond  to  the  materials  needs  of  the  nation.  The  overall  significance  of  this  work  lies  in  the 
combination  of  fundamental  scientific  understanding  coupled  to  directed  applications  of  national 
importance. 
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The  selectivity  of  antimicrobial  photodynamic  therapy  (PDT)  can  be  enhanced  by  coupling  the  photosensitizer  (PS) 
to  a  targeting  ligand.  Nanoplatforms  provide  a  medium  for  designing  delivery  vehicles  that  incorporate  both  functional 
attributes.  We  report  here  the  photodynamic  inactivation  of  a  pathogenic  bacterium.  Staphylococcus  aureus ,  using 
targeted  nanoplatforms  conjugated  to  a  photosensitizer  (PS).  Both  electrostatic  and  complementary  biological  interactions 
were  used  to  mediate  targeting.  Genetic  constructs  of  a  protein  cage  architecture  allowed  site-specific  chemical 
functionalization  with  the  PS  and  facilitated  dual  functionalization  with  the  PS  and  the  targeting  ligand.  These  results 
demonstrate  that  protein  cage  architectures  can  serve  as  versatile  templates  for  engineering  nanoplatforms  for  targeted 
antimicrobial  PDT. 


Introduction 

The  limitations  of  conventional  antimicrobial  therapy  are  being 
exposed  by  the  manifestation  of  two  forms  of  microbial 
resistance:  acquired  antibiotic  resistance  exhibited  at  the  single¬ 
cell  level  and  the  intrinsic  resistance  of  microbial  biofilm 
communities.* 1*2  A  promising  alternative,  photodynamic  therapy 
(PDT),  relies  on  photosensitizers  (PS)  which  when  activated  by 
light  produce  reactive  oxygen  species  (ROS).  ROS  react  with 
accessible  cell  components,  subverting  their  function  and  finally 
causing  cell  death.3*4 S. *  Bacterial  strains  that  have  acquired  resistance 
to  conventional  antimicrobials  are  susceptible  to  antimicrobial 
PDT.5-7  In  addition,  antimicrobial  PDT  has  been  successfully 
used  to  control  biofilms8-10  and  forms  the  basis  for  emerging 
adjuvant  and  alternative  treatments  for  biofilm  infections  of  the 
oral  cavity.11 

Collateral  damage  to  host  tissue  is  a  substantial  concern  for 
all  forms  of  PDT,  and  the  capability  to  more  specifically  target 
microbes  with  PS  would  significantly  expand  the  range  of  possible 
clinical  applications.12  Nanoplatforms  provide  a  relatively  large 
surface  area  that  can  be  used  to  engineer  the  presentation  of  both 


*  To  whom  correspondence  should  be  addressed.  E-mail:  myoung(d) 
montana.edu  (M.Y.);  tdouglas^chcmistry. montana.edu  (T.D.). 

+  Department  of  Microbiology. 

*  Center  for  Biolnspircd  Nanomatcrials. 

*  Department  of  Chemistry  &  Biochemistry. 

11  Department  of  Plant  Sciences. 

( 1 )  Costerton.  J.  W.;  Stewart.  P.  S.;  Greenberg,  E  P.  Science  1999,  284. 1 3 1 8 — 
1322. 

(2)  Walsh,  C.  Nature  2000,  406 ,  775-781. 

(3)  Brown,  S.  B.;  Brown,  E.  A.;  Walker,  I.  Lancet  Oncol.  2004,  5,  497-508. 

(4)  Hamblin,  M.  R.;  Hasan,  T.  Photochem.  Photobiol.  Sci.  2004.  3,436—450. 

(5)  Szpakowska,  M.;  Lasocki.  K.;  Grzybowski,  J.;  Graczyk,  A.  Pharmacol. 
Res.  2001.  44,  243-246. 

(6)  Wainwright,  M.;  Phoenix,  D.  A.;  Gaskell.  M.;  Marshall,  B.  J.  Antimicroh. 
Chemother.  1999,  44,  823-825. 

(7)  Wilson,  M.;  Yianni,  C.  J.  Med.  Microbiol.  1995,  42,  62—66. 

(8)  Wainwright.  M.;  Crossley,  K.  B.  hit  Biodeterior  Biodegrad.  2004.  53, 
119-126. 

(9)  Wilson.  M.  Photochem.  Photobiol.  Sci  2004.  3,  412-418. 

(10) Zanin.  I.  C.  J.;  Lobo,  M.  M.;  Rodrigues,  L.  K.  A.;  Pimenta,  L.  A.  F.; 
Hofling,  J.  F.;  Goncalves,  R  B.  Eur.  J.  Oral  Sci.  2006.  114,  64—69. 

(11)  Meisel.  P.;  Koeher.  T. ,/.  Photochem  Photobiol.  B-Biol.  2005,  79,  159— 
170. 

( 1 2)  Demidova,  T.  N.;  Hamblin,  M.  R  hit.  J.  Immunopathol.  Pharmacol. 
2004.  17,  245-254. 


the  PS  and  targeting  ligand  to  combine  cell  selectivity  with 
photodynamic  killing.13  Self-assembling  protein  cage  architec¬ 
tures  offer  an  exceptionally  versatile  template  for  design  of 
multifunctional  nanoplatforms  that  optimize  functional  group 
presentation,  and  thus  are  ideally  suited  as  delivery  vehicles  for 
selective  PDT. 14-23  The  unique  subunit  structure  of  protein  cages 
enables  genetic  insertion  of  chemically  reactive  amino  acids  or 
biologically  reactive  peptides  into  the  monomeric  protein  subunits. 
These  are  subsequently  presented  symmetrically  over  the  entire 
outer  (or  inner)  surface  of  the  assembled  protein  cage.24*25  The 
ability  to  use  a  combination  of  genetic  and  chemical  modifications 
to  engineer  protein  cages  expands  the  possibilities  for  design  of 
multifunctional  nanoplatforms  that  can  be  used  for  selective 
targeting  of  PS.  In  addition,  their  nonspecific  adsorption  to 
mammalian  cells  is  minimal,25*26  probably  due  to  their  intrinsic 
negative  surface  charge  at  physiological  pH. 

Staphylococcus  aureus  is  a  prominent  player  in  infections  that 
are  difficult  to  treat  due  to  acquired  resistance  to  antibiotics27*28 
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and  is  also  one  of  the  primary  biofilm-forming  mierobes  involved 
in  persistent  biofilm  infections.1  We  previously  demonstrated 
that  the  viral  protein  cage  (cowpea  chlorotic  mottle  virus:  CCMV) 
could  be  used  to  target  S.  aureus  cells  with  MRI  contrast  agent.29 
CCMV  is  a  28  nm  diameter  icosahedral  plant  virus30  that  has 
served  as  a  template  for  implementing  several  novel  approaches 
for  spatial  control  of  multiligand  presentation.3132  Here,  we  show 
that  a  genetically  modified  CCMV  nanoplatform  functionalized 
with  PS  can  be  targeted  to  S.  aureus  cells  using  both  electrostatic 
and  complementary  biological  interactions  and  that  the  cell- 
bound  PS  functionalized  protein  cages  induce  light-activated 
killing  under  standard  light  fluence  conditions. 

Experimental  Section 

Genetic  Modification  of  CCMV/K42R  and  Production  of  the 
Constructs  in  Plants.  The  salt  stable  mutant  of  cowpea  chlorotic 
mottle  virus  in  which  lysine  42  was  replaced  by  an  argenine  (CCMV/ 
K42R)  has  been  described  previously.30  With  use  of  this  construct 
as  starting  material,  polymerase  chain  reaction  based  site-directed 
mutagenesis  was  used  to  produce  S102C/K42R  and  S130C/K42R, 
in  which  serines  (102  and  130,  respectively)  were  replaced  by 
cysteines.  Complementary  oligonucleotide  primers  were  5'  GT- 
TGCTTCCCAGTGTTTGTGGTACCGTGAAATCCTGTG  3'(S  1 02C/ 
K42R,  plus  strand),  5'CACAGGATTTCACGGTACCACAAA- 
CACTGGGAAGCAAC  3'  (S102C/K42R,  minus  strand);  5'GCTGTG 
GCCGACAATTGCA  AAGATGTTGTCGC  3'(S1 30C/K42R,  plus 
strand)  and  5'  GCGACAACATCTTTGCAATTGTCGGCCACAGC 
3'  (S130C/K42R,  minus  strand).  Plasmids  containing  the  mutated 
sequences  were  first  screened  by  digestion  with  unique  restriction 
enzymes  (Kpnl  for  S102C/K42R  and  Mfel  for  S130C/K42R).  The 
mutations  in  the  CCMV  coat  proteins  of  S102C/K42R  and  S130C/ 
K42R  were  confinned  by  DNA  sequencing.  Constructs  were 
produced  in  cowpea  plants  and  the  virus  was  isolated  as  previously 
described.33  Purity  was  verified  using  size  exclusion  chromatography 
(SEC)  (Superose  6,  Amersham  Biosciences)  and  dynamic  light 
scattering  (DLS).34 

Synthesis  of  Functionalized  PS.  5-Iodoacetoamino-l  ,10-phen- 
athroline  (phen-IA)  was  synthesized  by  modification  of  a  previously 
published  protocol.35  A  solution  of  1 ,3-dicyclohexylcarbodiimide 
(5.29  g)  and  iodoacetic  acid  (4.76  g)  in  50  mL  of  dry  ethyl  acetate 
was  stirred  for  3  h  at  room  temperature.  The  resulting  solution  was 
filtered  to  remove  the  urea,  dried  by  rotary  evaporation,  and 
redissolved  in  25  mL  of  acetonitrile.  This  solution  was  added  to  25 
mL  of  acetonitrile  containing  5-amino- 1 .10-phenanthroline  (1.0  g) 
and  stirred  overnight  at  room  temperature.  The  product  was  collected 
by  centrifugation,  washed  with  cold  5%  sodium  bicarbonate  and 
water,  and  dried  under  vacuum. 

Bis(2,2'-bipyridine)dichlororuthenium  (Ru(bpy2)CL)  was  syn¬ 
thesized  according  to  literature  procedures.36  Ruthenium  chloride 
trihydrate  (RuCl3*3H20)  (7.8  g),  bipyridine  (9.36  g),  and  lithium 
chloride  (8.4  g)  were  refluxed  in  dimethylformamide  (50  mL)  for 
8  h.  The  reaction  was  cooled  to  room  temperature,  250  mL  of  acetone 
was  added,  and  the  solution  was  stored  at  4  °C  overnight.  The  resultant 
product  was  filtered,  washed  with  water  and  ether,  and  dried  by 
suction. 
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Ru(bpy:)phen-I  A  was  synthesized  by  modification  of  a  previously 
published  protocol.37  RufbpyLCL  (0.7  g)  and  phen-IA  (0.5  g)  were 
refluxed  in  50  mL  of  MeOH  for  3  h  with  stirring.  The  solution  was 
filtered  and  the  product  was  precipitated  by  the  addition  of  a 
concentrated  aqueous  solution  of  ammonium  hexafluorophosphate. 
The  orange  solid  was  collected  by  filtration  and  washed  with  cold 
water  and  ether  and  dried  in  a  desiccator. 

PS  Conjugation  to  S102C/K42R  and  S130C/K42R  To  Produce 
S102C/K42R-PS  and  S130C/K42R-PS.  Genetically  modified 
CCMV  (S102C/K42R  or  S130C/K42R)  (1.5  mg  mL1,  75  //M 
monomer  subunit)  was  dialyzed  into  50  mM  HEPES  (pH  6.5) 
overnight  and  then  transferred  into  fresh  dialysis  buffer  (200  mL). 
Nitrogen  was  bubbled  into  the  dialysis  buffer  for  1  h.  40  /uL  of  a 
5.5  mM  solution  of  the  PS  (Ru(bpy2)phen-IA)  in  DMS  was  added 
to  1 .2  mL  of  the  CCMV  solution.  This  was  stirred  in  the  dark  at 
room  temperature  for  5  h  and  then  dialyzed  into  50  mM  HEPES  (pH 
6.5).  SEC  was  used  for  further  purification.  Fractions  eluting  from 
within  the  CCMV  peak,  the  position  of  which  was  predetermined 
using  unlabeled  CCMV,  were  combined.  These  were  further  purified 
by  dialysis  into  100  mM  sodium  acetate  buffer  (pH  4.8)  and 
concentrated  to  1 .0  mg  mL-1  protein  using  ultrafiltration  (Amicon). 
Integrity  of  the  labeled  virus  was  confinned  by  DLS  and  TEM  as 
described  previously.34  UV— visible  spectroscopy  was  used  to 
determine  the  molar  ratio  of  PS  to  monomer  subunit.  PS-conjugated 
S102C/K42R  and  S130C/K42R  are  referred  to  as  SI 020  K42R-PS 
and  S130C/K42R-PS,  respectively. 

Biotinylation  of  S130C/K42R-PS  To  Produce  S130C/K42R- 
PS-B.  SI  30C/K42R-PS  was  biotinylated  by  reaction  with  a  0.5  mM 
solution  of  sulfosuccinimidyl-6/-(biotinamido)-6-hexanamido  hex- 
anoate  (Pierce)  with  1.0  mg  mL-1  of  S130C/K42R-PS  (50  //M 
monomer  subunit)  in  50  mM  HEPES  buffer,  1 50  mM  sodium  chloride 
at  pH  7. 1  at  room  temperature  in  the  dark  with  stirring  for  30  min. 
The  reaction  was  terminated  by  exchange  into  1  mM  sodium  acetate 
buffer  at  pH  4.8  using  SEC.  Fractions  eluting  from  within  the  CCMV 
peak,  the  position  of  which  was  predetermined  using  unlabeled 
CCMV,  were  combined,  further  purified  by  dialysis  into  100  mM 
sodium  acetate  buffer  (pH  4.8),  and  concentrated  to  1 .8  mg  mL-1 
CCMV  using  ultrafiltration  (Centricon  Microcon  YM- 1 0).  Integrity 
of  the  labeled  virus  was  confirmed  by  DLS  and  TEM.  Association 
of  the  biotin  functional  groups  with  the  biotinylated  product  ( S 1 30C7 
K42R-PS-B)  was  initially  tested  by  a  dot  blot  assay  using  alkaline 
phosphatase  conjugated  antibiotin  antibody  ( A-7064,  Sigma-Alrich 
Co.). 

Liquid  Chromatography/Electrospray  Mass  Spectrometry. 

Liquid  chromatography/electrospray  mass  spectrometry  (LC/MS) 
was  performed  on  a  QToF  Micro  instrument  (Waters).  CCMV 
injected  at  0.5— 1.0  mg  mL-1  (1—2  //L)  was  eluted  from  a 
Thermoelectric  Biobasic  size  exclusion  column  (250  x  1  (mm))  in 
80%  isopropyl  alcohol  and  0. 1  %  formic  acid.  The  virus  disassembled 
into  monomers  during  the  ionization  process  that  were  analyzed  by 
the  detector.  Mass  spectra  of  the  multiply  charged  ions  were 
deconvoluted  using  instrument  software  to  produce  a  representation 
of  monomer  mass  versus  intensity. 

Cell  Culturing.  ATCC  strain  12598  (Cowan  I)  was  maintained 
at  —80  °C  in  20%  glycerol,  2%  peptone.  A  solid  medium  (nutrient 
agar,  Difco  0001 )  was  streaked  for  single  colonies.  A  single  colony 
was  used  to  inoculate  5  mL  of  a  broth  culture  (nutrient  broth,  Difco 
0003)  and  placed  on  a  shaker  at  37  °C  at  280  rpm  overnight.  Fresh 
broth  (5  mL)  was  inoculated  from  the  starter  culture  to  achieve  an 
OD6(k)  nm  of  0.250  and  incubated  as  above  for  3  h.  This  procedure 
resulted  in  a  culture  that  had  undergone  about  3  doublings  in  the 
fresh  medium  and  was  in  exponential  phase  (Supporting  Information, 
Figure  1 S).  The  Cowan  I  strain  used  in  this  work  is  a  BSL2  pathogen. 
BSL2  practices  were  followed  in  accordance  with  the  guidelines 
established  at  Montana  State  University. 

Electrostatic  Targeting  of  SI  02C/K42R-PS  to  Cells.  Cells  were 
cultured  at  37  °C  in  batch  in  nutrient  broth,  harvested  in  exponential 
phase,  and  resuspended  in  phosphate  buffered  saline  (PBS)  ( 10  mM 
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sodium  phosphate,  100  mM  sodium  chloride,  pH  7.0)  to  an  OD6oonm 
of  0.2  (approximately  1.5  x  107  colony  forming  units  (CFU)  per 
mL).  To  form  the  poly-i  -lysine  (PLL)  interlayer,  the  cell  suspension 
was  exposed  to  6.25  /<g  mL-1  PLL  (PI 274,  Sigma-Aldrich  Co.)  for 
5  min.  These  cells  were  then  exposed  to  S102C/K42R-PS  (2  //M 
equivalent  PS  concentration). 

Targeting  of  Cells  with  S130C/K42R-PS-B  Using  Comple¬ 
mentary  Biological  Interactions.  We  modified  previously  published 
protocols  to  react  biotinylated  antibody  with  the  cells  and  then  react 
streptavidin  (StAv)  to  the  antibody-coated  cells. 3X39  Exponential 
phase  cells  were  pelleted  at  4500Xg  for  10  min  and  resuspended  in 
5  mL  of  PBS  at  an  ODftoonm  of  0.3.  The  cell  suspension  was  incubated 
for  30  min  on  ice  with  a  1:100  dilution  of  biotinylated  antiprotein 
A  (SpA)  monoclonal  antibody  (anti-SpA  mAb-B)  (P3150  Sigma- 
Aldrich  Co. ).  This  cell  suspension  was  w  ashed  once  in  PBS  to  achieve 
a  1:100  dilution  of  the  antibody  and  incubated  with  20  // g  mL-1 
StAv  (Sigma-Aldrich  Co.)  for  30  min  on  ice.  After  two  washing 
steps  in  PBS  (a  1 :400  dilution  of  the  StAv)  the  OD6oonni  of  the  cell 
suspension  was  adjusted  to  0.2.  These  cells  were  then  exposed  to 
S130C/K42R-PS-B  (2  //M  equivalent  PS  concentration). 

Field  Emission  Scanning  Electron  Microscopy.  PLL-coated  Si 
(100)  wafers  (Virginia  Semiconductor  Inc.,  Fredericksburg,  VA) 
were  exposed  to  cells  fixed  in  3%  gluteraldehyde  in  PBS  for  1  h. 
The  coupons  were  rinsed  twice  in  Nanopure  water  and  dried  under 
a  stream  of  nitrogen.  The  coupons  with  adsorbed  cells  were  coated 
with  a  thin  film  of  iridium  by  exposing  the  sample  for  15  s  at  20 
mA  in  a  Emitech  sputter  coater.  Cells  were  viewed  with  a  Supra 
55VP  FESEM  (Zeiss)  using  the  Inlens  detector  at  1  kV  and  3  mm 
working  distance. 

Evaluation  of  Photodynamic  Killing.  For  evaluation  of  pho¬ 
todynamic  killing  at  the  low  fluence  rate,  cell  suspensions  were 
distributed  in  wells  of  a  96-microtiter  well  plate  (200 //L)  immediately 
after  addition  of  S 1 02C/K42R-PS  and  were  exposed  to  a  bank  of 
LEDs  (peak  emission  at  470  nm)  in  a  temperature-controlled  light 
box  (20  °C)  at  a  fluence  rate  of  0.763  mW  cm-2  for  3.5  h,  yielding 
a  total  fluence  of  9.6  J  cm-2.  For  evaluation  of  photodynamic  killing 
at  the  more  standard  fluence  rate  (46  mW  cm-2),  cell  suspensions 
were  distributed  into  wells  formed  in  a  Teflon  block  (100  /*L/well) 
20  min  after  addition  of  S102C/K42R-PS  or  S130C/K42R-PS  and 
illuminated  by  light  from  a  mercury  vapor  lamp  coupled  to  a  Nikon 
Eclipse  E600  microscope  using  a  B2A  filter  block  (excitation  450— 
490  nm;  emission  >515  nm).  The  stage  height  was  adjusted  so  that 
the  surface  of  the  cell  suspension  was  beyond  the  focal  point  of  the 
20  x  objective,  17  mm  from  the  surface  of  the  objective  lens,  to 
achieve  a  circular  7  mm  spot  that  covered  the  surface  of  the  cell 
suspension  in  the  well.  Fluence  rate  was  determined  using  a 
photometer.  Times  of  illumination  were  5, 1 0,  and  20  min  to  achieve 
total  fluences  of  1 3.8, 27.6,  and  55.2  J  cm-2,  which  are  in  the  standard 
range  for  antimicrobial  PDT.40  Viable  cells  numbers  were  evaluated 
by  enumeration  of  CFU  on  a  solid  medium  (nutrient  agar)  after 
incubation  at  37  °C  for  16  h. 

Optical  Microscopy  .  The  red  fluorescence  from  the  PS  associated 
with  targeted  cells  was  observed  at  lOOOx  (lOOx  objective  and 
lOx  ocular)  using  the  same  microscope  and  filter  set  as  used  to 
illuminate  cells  for  photodynamic  killing  at  standard  fluence  rates. 
The  fluorescence  emission  under  these  conditions  is  faint  and  rapidly 
photobleaches.  However,  the  red  fluorescence  from  targeted  cells 
was  clearly  visible.  Images  were  acquired  w  ith  an  Olympus  Camedia 
camera  coupled  to  the  microscope. 

Results  and  Discussion 

As  a  delivery  vehicle  for  PS,  we  modified  a  genetic  construct 
(K42R)  of  CCMV  that  is  exceptionally  stable  (referred  to  as 
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Figure  1.  Genetically  modified  CCMV  nanoplatform  (NP)  was 
functionalized  with  the  PS  (Ru(bpy:)phen-IA)  at  surface-exposed 
sites.  The  positions  of  the  sulfhydryl  groups  of  the  cysteines  (Cys) 
added  into  the  two  genetic  constructs  are  shown  for  a  hexamer  of 
protein  subunits:  S 1 02C/K42R  (orange)  and  S 1 30C/K42R  (yellow); 
the  position  of  these  sulfhydryls  on  the  pentamers  is  similar.  The 
PS  was  modified  to  promote  reaction  with  exposed  sulfhydryls  by 
adding  an  iodoacetamide  group  to  the  phenanthroline  moiety. 

CCMV/K42R).  Site-specific  mutagenesis  was  used  to  replace 
serines  with  reactive  cysteines  at  surface-exposed  sites  on  each 
of  the  180  (20  kDa)  protein  monomer  subunits  that  form  the 
assembled  CCM  V/K42R  protein  cage.  These  served  as  attachment 
points  for  the  PS  (Figure  1 ).  The  PS,  a  ruthenium  complex,  was 
modified  to  react  selectively  with  the  sulfhydryls  of  the  surface- 
exposed  cysteines  on  these  genetic  constructs  (S 1 02C/K42R  and 
S 1 30C/K42R)  by  functionalizing  the  phenanthroline  moiety  with 
an  iodoacetamide  group.  PS-conjugated  constructs  are  referred 
to  as  S 1 02C/K42R-PS  and  S 1 30C/K42R-PS,  respectively.  U  V  - 
visible  absorption  spectroscopy  was  used  to  establish  the 
association  of  the  PS  with  each  construct  and  to  determine  the 
molar  ratio  of  bound  PS  to  monomeric  subunits  (estimated  as 
0.520  for  S 1 02C/K42R-PS  and  0.380  for  S 1 30C/K42R-PS)  (see 
Supporting  Information,  Figure  2S).  In  addition,  LC/MS  was 
used  to  confirm  covalent  linkage  of  the  PS  to  the  CCMV  constructs 
(see  Supporting  Information,  Figure  3S). 

We  have  previously  used  both  electrostatic  and  complementary 
biological  interactions  to  direct  the  assembly  of  functionalized 
CCMV  into  organized  films.41  These  same  two  strategies  were 
used  to  target  PS-conjugated  nanoplatforms  to  the  cells  (Figure 
2a).  Cationic  polymers,  including  poly-L-lysine  (PLL),  have  been 
used  to  enhance  the  activity  of  PS.42*43  In  general,  cell  walls  tend 
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Figure  2.  Targeting  of  PS-functionalized  CCMV  nanoplatforms  to 
S.  aureus  cells,  (a)  Targeting  strategies  using  electrostatic  (left)  or 
complementary  biological  interactions  (right).  Abbreviations  are 
(from  left  to  right):  photosensitizer  (PS);  nanoplatform  (NP);  poly- 
L-lysine  (PLL);  S.  aureus  cell  (Sa);  protein  A  (SpA);  anti-SpA- 
mAb-B  (Ab),  biotin  (B)  and  streptavidin  (StAv).  (b,  c,  d)  Epi- 
fluorescence  images:  (b)  Cells  targeted  with  S102C/K42R-PS  via 
electrostatic  interactions  (exposure  time,  0.5  s);  (c)  cells  targeted 
with  S130C/K42R-PS  via  complementary  biological  interactions 
(exposure  time,  0.5  s);  (d)  cells  mixed  with  S130C/K42R-PS 
(nontargeted)  (exposure  time,  1  s);  the  cells  in  (d)  appear  green  due 
to  faint  intrinsic  fluorescence;  the  red  background  originates  from 
the  PS-conjugated  nanoplatform  in  solution.  Scale  bars  in  the 
magnified  inserts  are  1  //m. 

to  carry  a  net  negative  charge.  Thus,  we  predicted  that  electrostatic 
interactions  could  provide  a  simple  method  to  nonspecifically 
target  the  nanoplatform  to  S.  aureus.  In  addition,  PLL  has  intrinsic 
antimicrobial  properties,44  suggesting  that  an  interlayer  of  PLL 
on  the  cell  wall  would  enhance  the  activity  of  a  PS-functionalized 
nanoplatform.  Whereas  electrostatic  interactions  can  be  used  to 
mediate  relatively  nonspecific  targeting  to  cells,  complementary 
biological  interactions  offer  a  means  to  selectively  target  PS  to 
specific  microbes.45-47  S 1 30C/K42R-PS  was  specifically  targeted 
to  protein  A  (SpA)  presented  in  the  cell  wall  of  S.  aureus  by 
using  streptavidin  (StAv)  to  couple  biotinylated  anti-SpA 
monoclonal  antibody  (anti-SpA  mAb-B)  to  biotinylated  nano¬ 
platform  (Figure  2a).  SpA  binds  to  the  Fc  region  of  IgG  and  may 
play  a  role  in  immune  avoidance.48  We  used  flow  cytometry  to 
characterize  expression  of  SpA  in  the  S.  aureus  ATCC  12598 
strain  cultured  under  our  conditions  (see  Supporting  Information, 
Figure  4S). 

A  biotin  targeting  ligand  was  added  to  the  endogenous  lysines 
ofS130S/K42R-PS  to  produce  SI  30S/K42R-PS-B.  The  S130C/ 
K.42R  construct  was  chosen  for  dual  functionalization  with  PS 
and  the  biotin  functional  group  based  on  the  consideration  of 
minimizing  possible  interference  with  binding  of  StAv  to  the 
biotin  targeting  ligand  due  to  steric  hindrance  from  the  relatively 
large  PS  functional  group.  Preliminary  results  indicated  that  one 
of  the  most  reactive  endogenous  lysines  (K87)  is  located  near 
the  pseudo  3-fold  axis  of  the  CCMV  viral  capsid  which  is  well¬ 
spaced  from  the  reactive  cysteines  on  S130C/K42R.  Covalent 
linkage  of  the  biotin  functional  groups  into  the  S130C/K42R- 
PS-B  construct  was  confirmed  by  LC/MS  (see  Supporting 
Information,  Figure  3S). 

The  presence  of  PS-conjugated  nanoplatform  on  targeted  cells 
was  first  verified  by  epi-fluorescence  microscopy  (Figures  2b— 
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2d).  Targeted  cells  appeared  red  while  cells  exposed  to  PS- 
conjugated  nanoplatfonn  without  targeting  produced  a  faint  green 
intrinsic  fluorescence  which  was  only  visible  in  images  acquired 
at  sufficiently  long  exposure  times.  The  red  fluorescence  from 
cells  targeted  using  electrostatic  interactions  was  clearly  more 
faint  than  the  fluorescence  from  cells  targeted  using  comple¬ 
mentary  biological  interactions  (antibody-SpA).  The  red  back¬ 
ground  in  the  images  originates  from  free  PS-conjugated 
nanoplatform  in  solution.  The  presence  of  PLL  probably 
precipitates  a  portion  of  the  nanoplatfonn  in  solution,  causing 
the  diffuse  background  in  Figure  2b  to  be  fainter  than  that  in 
Figure  2c. 

The  arrangement  of  targeted  nanoplatfonn  on  the  cell  wall 
was  characterized  directly  by  FESEM  (Figure  3).  The  FESEM 
images  indicated  that  the  cell  wall  associated  nanoplatfonns 
maintained  their  integrity.  Targeting  via  electrostatic  interactions 
produced  more  sparse  coverage  than  targeting  via  complementary 
biological  interactions  and,  in  addition,  targeting  via  comple¬ 
mentary  biological  interactions  produced  a  more  uniform 
coverage.  Twenty  images  were  acquired  of  cells  targeted  via  this 
latter  scheme.  The  image  presented  in  Figure  3c  is  representative 
of  the  high  density  of  coverage  exhibited  by  cells  targeted  with 
complementary  biological  interactions  (see  Supporting  Informa¬ 
tion,  Figure  5S).  The  level  of  nonspecific  binding  of  nontargeted 
PS-functionalized  nanoplatfonn  (S102C/K42R-PS)  shown  in 
Figure  la  is  representative  of  the  low  level  observed  for  SI  30C7 
K.42R-PS  as  well  (see  Supporting  Information,  Figure  6S). 

Cell  populations  targeted  with  the  PS-conjugated  CCMV 
constructs  via  both  electrostatic  and  complementary  biological 
interactions  were  photodynamically  inactivated  to  a  significant 
level  (Figures  4-6).  Cells  targeted  with  S102C/K42R-PS  via 
electrostatic  interactions  mediated  by  PLL  were  photodynamically 
inactivated  at  a  very  low  fluence  rate  (Figure  4)  as  well  as  at  a 
more  standard  fluence  rate  and  standard  total  fluence  levels  (Figure 
5).  Without  the  PLL  interlayer,  PS  conjugated  to  the  nanoplatform 
was  almost  ineffectual  in  killing  cells:  nontargeted  PS-conjugated 
nanoplatoforms (SI 02C/K42R-PS and  S 1 30C/K42R-PS) did  not 
induce  appreciable  photodynamic  killing  under  conditions  in 
which  free  PS  in  solution  reduced  viable  cell  numbers  by  5 
orders  of  magnitude  (see  Supporting  Information,  Figure  7S). 
Results  presented  in  Figures  4  and  5  show  that  a  PLL  interlayer 
on  the  cell  wall  rendered  the  level  of  killing  activity  of  PS 
conjugated  to  the  nanoplatform  to  within  the  range  of  free  PS. 
These  results  demonstrate  that  PS  at  concentrations  that  are 
normally  cidal  upon  light  activation  is  innocuous  when  the  PS 
is  conjugated  to  the  CCMV  nanoplatform  (but  not  targeted  to 
cells),  while  targeted  PS-conjugated  CCMV  can  be  used  to  obtain 
high  local  PS  concentrations  which  are  cidal. 

The  killing  activity  of  S102C/K42R-PS  targeted  via  electro¬ 
static  interactions  is  likely  attributable  to  a  combination  of 
membrane  disruption  by  PLL  and  ROS  production  by  PS  located 
proximal  to  the  cell  membrane.  Under  dark  conditions,  PLL  at 
the  concentration  used  for  these  experiments  (6.25  f.ig  niL  ') 
reduced  the  viable  cells  by  about  half  an  order  of  magnitude, 
probably  by  compromising  the  cell  membrane  (Figures  4  and 
5c).  Association  of  S102C/K42R-PS  with  cells  pre-exposed  to 
PLL  significantly  reduced  the  number  of  viable  cells  compared 
to  exposure  to  PLL  alone  for  experiments  at  the  low'  fluence  rate 
(8  independent  replicates,  paired  t-test,  p  value  0.006)  (Figure 
4).  This  same  trend  was  observed  at  a  more  standard  fluence  rate 
and  standard  total  fluence  levels  (Figures  5a  and  5c).  The 
association  of  PLL  with  the  cell  wall  also  enhanced  the  killing 
activity  of  free  PS  in  solution  (compare  Figure  5b  and  Figure 
6c)  as  anticipated  from  previous  results.42*43  The  dependence  of 
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Figure  3.  FESEM  images  showing  the  arrangement  of  targeted 
nanoplatforms  on  the  cell  wall:  (a)  cell  exposed  to  S102C/K42R 
with  no  targeting;  (b)  cell  targeted  with  S102C/K42R-PS  via 
electrostatic  interactions;  (c)  cells  targeted  with  S130C/K42R-PS 
via  complementary  biological  interactions;  (d)  magnified  view  of 
the  region  indicated  in  (b);  (e)  magnified  view  of  the  region  indicated 
in  (c).  Scale  bars:  (a  ,b)  200  nm;  (c)  100  nm;  (d,  e)  28  nm 
(the  diameter  of  CCMV).  Note:  CCMV  in  solution  adsorbs 
strongly  to  the  positively  charged  substratum  used  for  FESEM 
visualization.49 


the  killing  activity  on  light  fluence  in  the  absence  of  PS  exhibited 
in  Figure  5c,  and  much  less  prominently  in  Figure  5d,  suggests 
that  there  were  intrinsic  PS  associated  with  the  cell  wall  whose 
killing  activity  was  also  enhanced  by  PLL.50’51  The  results  suggest 


(49)  Suci,  P  A.;  Klem,  M.  T.;  Douglas,  T.;  Young.  M.  Langmuir  2005.  21, 
8686-8693. 

(50) Nitzan,  Y.;  Salmon-Divon,  M.;  Shporen,  E.;  Malik,  Z.  Photochem. 
Photobiol.  Sci.  2004.  3,  430-435. 


Figure  4.  Photodynamic  inactivation  of  S.  aureus  cells  targeted 
with  PS-functionalized  nanoplatform  (NP-PS)  via  electrostatic 
interactions  at  a  fluence  rate  of  0.763  mW  cm-2  (total  fluence  of 
9.6  J  cm-2).  Data  are  presented  as  the  ratio  of  CFU  for  illuminated 
cells  to  CFU  for  cells  maintained  in  the  dark.  Error  bars  are  standard 
errors  for  the  mean  of  eight  independent  replicates:  PLL  T  NP-PS: 
cells  exposed  to  PLL  and  targeted  with  S102C/K42R-PS  at  2  //M 
equivalent  PS  concentration;  PLL  +  PS:  cells  exposed  to  PLL  and 
the  photosensitizer  (2  //M);  PLL:  cells  exposed  only  to  PLL;  NT: 
no  treatment.  A  paired  t-test  indicated  that  the  difference  between 
the  means  for  the  pairs  (PLL)  and  (PLL  4-  NP-PS)  were  significant 
at  the  1%  level,  whereas  the  difference  between  the  means  for  the 
pairs  (PLL  4-  PS)  and  (PLL  T  NP-PS)  were  not  significant  at  the 
10%  level.  (The  differences  between  the  mean  for  the  untreated 
condition  and  the  means  for  the  other  three  conditions  were  each 
significant  at  the  1%  level.) 
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Figure  5.  Photodynamic  inactivation  of  S.  aureus  cells  targeted 
with  PS-functionalized  nanoplatform  (NP-PS)  via  electrostatic 
interactions  at  a  fluence  rate  of  46  mW  cm-2.  Data  are  presented 
as  CFU  mL-1  recovered  after  treatment.  Error  bars  are  standard 
errors  for  the  mean  of  two  independent  replicates,  (a)  Cells  exposed 
to  PLL  and  targeted  with  S102C/K42R-PS  at  2  // M  equivalent  PS 
concentration;  (b)  cells  exposed  to  PLL  and  the  photosensitizer  (2 
//M);  (c)  cells  exposed  only  to  PLL;  (d)  no  treatment. 


that  the  killing  activities  conferred  by  a  cationic  polymer  or 
peptide  and  a  PS  both  conjugated  to  a  nanoplatform  would 
combine  to  produce  significant  levels  of  cell  inactivation. 

S130C/K42R-PS-B  targeted  to  S.  aureus  cells  using  comple¬ 
mentary  biological  interactions  produced  levels  of  photodynamic 


(51)  Kafala,  B.;  Sasarman.  A.  Gene  1997.  199,  231-239. 
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Figure  6.  Photodynamic  inactivation  of  S.  aureus  cells  targeted 
with  PS-functionalized  nanoplatform  (NP-PS)  via  complementary 
biological  interactions  at  a  fluence  rate  of  46  mW  cm-2.  Data  are 
presented  as  CFU  mL_1  recovered  after  treatment.  Error  bars  are 
standard  errors  for  the  mean  of  three  independent  replicates,  (a) 
Cells  targeted  with  S130C/K42R-PS-B  at  2  //M  equivalent  PS 
concentration;  (b)  cells  exposed  to  SI  30C/K.42R-PS  at  2  //M 
equivalent  PS  concentration  (nontargeted);  (c)  cells  exposed  to  2 
//M  PS;  (d)  cells  targeted  with  anti-SpA-mAb-B  and  StAv  but  not 
with  the  PS-functionalized  nanoplatform.  An  unpaired  t-test  indicated 
that  the  means  for  the  targeted  cells  were  significantly  different 
from  means  for  all  other  conditions  at  fluences  of  27.6  and  55.2  J 
enr2  at  the  5%  level. 


killing  that  were  significantly  greater  than  the  free  PS  in  solution 
(2  /*M)  and  nontargeted  S130C/K42R-PS  at  an  equivalent  PS 
concentration  (Figure  6).  Data  presented  in  Figure  6  are  the  means 
and  standard  error  for  three  independent  experiments.  The 
reduction  in  viable  cells  at  27.6  and  55.2  J  cm-2  is  significantly 
greater  at  the  5%  level  of  confidence  for  cells  targeted  with 
S130C/K42R-PS-B  compared  with  cells  exposed  to  S130C/ 
K42R-PS  (not  targeted)  and  cells  exposed  to  PS  alone  (unpaired 
t-test). 

The  level  of  killing  produced  in  cells  targeted  via  comple¬ 
mentary  biological  interactions  (Figure  6a)  was  approximately 
the  same  as  that  achieved  by  targeting  SpA  in  S.  aureus  with  an 
antibody  conjugated  to  a  PS  under  comparable  conditions  of 
both  light  fluence  levels  and  equivalent  PS  concentration.47  There 
were  approximately  70  PS  delivered  per  binding  event  for  CCM  V 
targeted  via  complementary  interactions,  while  only  about  2  PS 
per  binding  event  were  delivered  using  direct  conjugation  to  the 
antibody.  Therefore,  the  level  of  killing  obtained  with  the  targeted 
nanoplatform  might  be  expected  to  exceed  that  obtained  with 
PS-conjugated  antibody  if  killing  were  limited  by  PS  per  binding 
event.  One  possibility  is  that  the  relatively  large  size  of  CCMV 
significantly  reduced  the  proximity  of  PS  to  critical  cell 
components.  Proximity  is  essential  for  the  activity  of  singlet 
oxygen  which  has  a  very  short  diffusion  length  (<50  nm)  and 
is  thought  to  be  the  primary  ROS  causing  oxidative  damage.52 
We  have  not  attempted  to  construct  a  detailed  model  of  binding 
of  CCMV  to  SpA  using  our  scheme  (Figure  2),  but  approximate 
modeling  indicates  that  the  distance  between  PS  coupled  to 


(52)  Ochsner.  M  J.  Photochem.  Photobiol.  B-Biol  1997.  39.  1-18. 


CCMV  that  is  bound  to  cells  and  the  cell  membrane  was  less 
than  50  nm  (Supporting  Information,  Figure  8S). 

The  killing  activity  might  have  been  limited  by  the  distribution 
of  SpA  expression  in  the  cell  population,  rather  than  the  quantity 
of  PS  delivered  per  binding  event.  This  was  proposed  as  a  primary 
constraint  on  killing  in  the  case  of  targeting  with  PS-conjugated 
antibody.47  The  Cowan  I  strain  used  in  our  experiments  is  known 
to  express  SpA  at  a  high  level.38  However,  according  to  the  flow 
cytometry  analysis  the  level  of  expression  of  SpA  in  a  small 
portion  of  the  cell  population  was  very  low  or  negligible  (see 
Supporting  Information,  Figure  4S).  Our  results  indicate  that  the 
majority  of  these  nontargeted  cells  would  survive  exposure  to 
the  fluence  levels  used  in  our  experiments.  The  PS  loading 
advantage  offered  by  a  targeted  CCMV  nanoplatform  (compared 
to  a  smaller  biomolecule  such  as  an  antibody)  might  have  been 
more  evident  in  a  case  in  which  epitope  presentation  in  the  cell 
wall  was  more  sparse,  but  uniformly  distributed  among  the  entire 
cell  population. 

In  addition  to  offering  the  possibility  of  delivering  substantial 
PS  per  binding  event,  the  relatively  large  surface  area  of 
nanoplatforms  provides  space  for  decoration  with  multiple 
targeting  ligands.  The  ability  to  engineer  placement  of  multiple 
targeting  ligands  on  a  PS  delivery  vehicle  opens  up  the  possibility 
of  finely  tuning  selectivity.  This  capability  could  be  used  as  a 
means  to  eliminate  pathogens  while  preserving  beneficial 
commensal  microbes  for  treatment  of  oral  infections,1 1  or  to 
ensure  selective  eradication  of  genetic  or  physiological  variants 
of  a  pathogen  causing  a  recalcitrant  wound  infection.12  In  the 
case  of  S.  aureus  there  are  at  least  22  different  cell  wall  anchored 
proteins  that  are  involved  in  conferring  virulence  and  could 
potentially  be  targeted  with  appropriately  functionalized  nano- 
platforms.53  54 

For  topical  applications  the  indirect  targeting  scheme  used  in 
our  experiments  would  be  feasible.  Similar  strategies,  involving 
pretargeting  with  StAv,  have  been  proposed  even  for  intravenous 
applications.55  Protocols  for  biotinylation  of  antibodies  that 
preserve  binding  affinity  of  the  active  site  are  well-established, 
and  our  results  demonstrate  that  site-specific  dual  labeling  of 
CCMV  genetic  constructs  with  both  a  functional  biotin  and  an 
active  PS  is  manageable.  In  this  context,  nanoplatforms  such  as 
protein  cage  architectures  provide  an  accessible  template  for 
implementing  a  modular  approach  to  selective  PDT.  The 
advantage  of  this  alternative  approach  is  that  strategics  for 
targeting  multiple  ligands  and  optimizing  PS  loading  per  binding 
event  could  be  developed  independently  and  then  coupled  via 
the  nanoplatform. 


Conclusion 

Our  results  demonstrate  the  feasibility  of  using  PS-function¬ 
alized  nanoplatfonns  based  on  genetically  modified  CCMV  as 
targeted  delivery  vehicles  for  selective  antimicrobial  PDT.  If  the 
requirement  for  selectivity  is  not  too  stringent,  electrostatic 
interactions  can  be  used  to  target  PS-functionalized  CCMV  to 
microbial  pathogens.  In  this  case  relatively  dilute  concentrations 
of  the  cationic  polymer  significantly  enhance  the  photodynamic 
killing.  To  achieve  true  selectivity,  complementary  biological 
interactions  can  be  used.  The  simplest  strategy  in  this  case  is  to 
dual  functionalize  the  nanoplatform  with  a  biotin  and  the  PS. 


(53)  Marraffini,  L.  A.;  DeDent.  A.  C.;  Schneewind.  O.  Microbiol  Mol  Biol 
Rev.  2006.  70,  192-221. 

(54)  Harris.  L.  G.;  Foster.  S.  .1.;  Richards.  R.  G.  Eur.  Cells  Mater.  2002.  4. 
39-60. 

(55)  Wu,  A  M.;  Senter.  P.  D.  Nat  Biotechnol.  2005.  23.  1 137-  1 146. 
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The  advantage  of  being  able  to  use  a  combination  of  both  chemical 
and  genetic  methods  to  control  functional  group  presentation 
lends  protein  cage  architectures  a  versatility  not  offered  by  most 
other  nanoplatforms.  This  versatility  can  be  exploited  to  pursue 
a  modular  approach  to  design  vehicles  for  selective  antimicrobial 
PDT  in  which  PS  loading  and  targeting  functions  can  be 
independently  optimized. 
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Viral  capsids  have  the  potential  for  combined  cell/tissue  targeting, 
drug  delivery,  and  imaging.  Described  here  is  the  development  of 
a  viral  capsid  as  an  efficient  and  potentially  relevant  MRI  contrast 
agent.  Two  approaches  are  outlined  to  fuse  high  affinity  Gd3+ 
chelating  moieties  to  the  surface  of  the  cowpea  chlorotic  mottle 
virus  (CCMV)  capsid.  In  the  first  approach,  a  metal  binding  peptide 
has  been  genetically  engineered  into  the  subunit  of  CCMV.  In  a 
second  approach  gadolinium-tetraazacyclododecane  tetraacetic 
acid  (GdDOTA)  was  attached  to  CCMV  by  reactions  with  endog¬ 
enous  lysine  residues  on  the  surface  of  the  viral  capsid.  T<\  and  T2 
ionic  relaxivity  rates  for  the  genetic  fusion  particle  were  R1  =210 
and  R2  =  402  mM_1s_1  (R2  at  56  MHz)  and  for  CCMV  functionalized 
with  GdDOTA  were  R1  =  46  and  R2  =  142  mM-1s_1  at  61  MHz.  The 
relaxivities  per  intact  capsid  for  the  genetic  fusion  were  R1  = 
36,120  and  R2  =  69,144  mM-V1  (R2  at  56  MHz)  and  for  the 
GdDOTA  CCMV  construct  were  R1  =  2,806  and  R2  =  8,662  mM' 
is-1  at  61  MHz.  The  combination  of  high  relaxivity,  stable  Gd3+ 
binding,  and  large  Gd34  payloads  indicates  the  potential  of  viral 
capsids  as  high-performance  contrast  agents.  Magn  Reson  Med 
58:871-879,  2007.  ©  2007  Wiley-Liss,  Inc. 

Key  words:  MRI;  contrast  agent;  viral  capsids;  cowpea  chlorotic 
mottle  virus;  gadolinium 

MRI  is  one  of  the  most  utilized  imaging  techniques  in 
medicine  since  it  is  noninvasive  and  provides  compara¬ 
tively  high-resolution  information.  The  usefulness  of  the 
technique  is  augmented  by  the  use  of  contrast  agents  that 
increase  the  rate  of  water  proton  relaxation  and  can  there¬ 
fore  increase  contrast  between  tissues.  Gadolinium  (Gd34) 
chelates  are  commonly  used  as  contrast  agents  in  clinical 
settings  (1,2).  In  general,  there  are  two  ways  to  improve  the 
imaging  sensitivity  using  contrast  agents:  either  by  in¬ 
creasing  the  relaxivity  of  water  protons  through  direct 
interaction  with  the  contrast  agent  or  by  targeted  delivery 
of  the  agent  to  specific  locations  within  the  body. 

Viral  capsids  are  multimeric  protein  assemblies  that 
form  cage  architectures  and  can  be  generally  categorized  as 
protein  cages.  Other,  nonviral  protein  cages  include  heat 
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shock  proteins,  ferritins,  and  vault  ribonucleoprotein  par¬ 
ticles,  among  others.  These  protein  cages  can  serve  as 
robust  synthetic  platforms  that  are  chemically  and  genet¬ 
ically  malleable  and  can  be  readily  modified.  Previous 
studies  have  explored  the  use  of  protein  cages  as  therapeu¬ 
tic  or  imaging  delivery  agents  (3-5).  Cell  targeting  has  been 
achieved  by  utilizing  capsids  with  natural  affinities  for 
cellular  receptors  or  by  chemically  linking  peptides  or 
antibodies  to  protein  cage  architectures  (6,7).  In  addition, 
targeted  protein  cages  incorporating  a  therapeutic  payload 
(doxorubicin)  have  been  constructed,  demonstrating  the 
multifunctional  capacity  for  biomedical  applications  (8). 

Protein  cages  and  more  specifically  viral  capsids,  have 
the  potential  to  serve  as  extremely  efficient  contrast  agents 
for  the  following  reasons:  1)  viral  capsids  are  large,  com¬ 
monly  between  18-100  nm  in  diameter,  and  relatively 
rigid  molecular  structures  with  large  rotational  correlation 
times,  resulting  in  increased  relaxivity  rates;  2)  viral  cap¬ 
sids  can  serve  as  robust  platforms  onto  which  multiple 
functional  motifs  can  be  added  through  genetic  or  chemi¬ 
cal  modifications  (9-16).  These  modifications  could  po¬ 
tentially  result  in  the  attachment  of  both  Gd34  binding  and 
site-specific  targeting  functionalities;  and  3)  viral  capsids 
can  potentially  carry  hundreds  (if  not  thousands)  of  Gd3' 
ions  and  the  contrast  from  an  individual  capsid  will  in¬ 
crease  significantly  with  the  number  of  Gd34  ions  it  carries. 
Due  to  these  three  reasons,  viral  capsids  have  been  inves¬ 
tigated  as  MRI  contrast  agents,  including  the  cowpea  chlo¬ 
rotic  mottle  virus  (CCMV)  capsid  with  bound  Gd3*  at 
endogenous  metal  bind  sites  and  the  MS2  virus  capsid 
with  gadolinium  diethylenetriamine  pentaacetic  acid  (Gd- 
DTPA)  chemically  attached  (4,17).  Dendrimers,  liposomes, 
and  other  supramolecular  structures  maintain  properties  2 
and  3  mentioned  above  and  therefore  have  also  been  de¬ 
veloped  as  potential  contrast  agents  (18,19). 

In  this  study,  potential  MRI  contrast  agents  based  on  the 
multivalent  protein  cage  architecture  of  the  CCMV  capsid 
have  been  developed.  The  CCMV  viral  capsid  is  assembled 
from  180  identical  protein  subunits.  The  capsid  forms  an 
icosahedral  architecture  through  noncovalent  interactions 
between  the  subunits.  A  consequence  of  the  cage-like  ar¬ 
chitecture  assembly  is  that  amino  acid  residues  are  orga¬ 
nized  and  presented  in  a  repeating  and  symmetrical  fash¬ 
ion  over  the  capsid  surface.  Therefore,  genetic  and  chem¬ 
ical  modifications  of  amino  acids  within  the  subunit 
results  in  the  presentation  of  the  modification  at  all  180 
symmetrically-related  sites  on  the  viral  capsid.  The  exte¬ 
rior  diameter  of  the  CCMV  capsid  is  28  nm  and  the  interior 
diameter  is  22  nm  (Fig.  1)  (20). 

CCMV  has  been  shown  to  contain  an  endogenous  metal 
binding  site.  In  a  previous  study  characterizing  the  metal 
binding  by  fluorescence  resonance  energy  transfer  (FRET), 
tryptophan  residues  proximal  to  the  endogenous  metal 
binding  site  were  used  to  determine  Tb34,  Gd34,  and  Ca2 
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FIG.  1.  The  28-nm  viral  capsid  of  CCMV  is  made  of  180  individual 
subunit  proteins  (20  hexamers  and  12  pentamers). 

dissociation  constants  (Kj  =  19  jxM,  31  pM,  and  2  mM, 
respectively)  (21).  Further  studies  indicated  that  Gd3+- 
bound  CCMV  (CCMV-Gd)  capsids  exhibited  high  ionic 
relaxivity  rates  (Rl  =  202,  R2  =  376  mM_1s_1,  at  61  MHz) 
and  extremely  high  relaxivity  rates  per  capsid  (Rl  = 
28,482  and  R2  =  53,016  mM_1s~\  61  MHz)  (17).  However, 
Gd3+  binding  to  the  endogenous  sites  was  too  weak  for  the 
CCMV-Gd  to  be  useful  as  a  clinical  MRI  contrast  agent, 
since  free  Gd3+  is  toxic  in  vivo  (22). 

The  purpose  of  this  study  was  to  take  advantage  of  the 
CCMV  architecture  while  enhancing  the  Gd34  binding  con¬ 
stant  of  the  lanthanide  metal  ion  in  the  development  of  two 
nanoparticle  contrast  agents.  The  first  approach  was  to  ge¬ 
netically  incorporate  a  nine-residue  peptide  sequence,  from 
the  Ca2"  binding  protein  calmodulin,  as  a  genetic  fusion  to 
the  N-terminus  of  the  CCMV  subunit  (CCMV-CAL).  Charac¬ 
terization  of  the  metal  binding  to  the  genetically  engineered 
capsid  was  undertaken  using  FRET  analysis.  The  second 
approach  was  to  covalently  attach  the  clinically  relevant 
contrast  agent  gadolinium-tetraazacyclododecane  tetraacetic 
acid  (GdDOTA)  to  reactive  lysine  residues  on  CCMV  via  an 
N-hydroxysuccinimide  (NHS)  ester  coupling  reaction 
(CCMV-DOTA).  Both  Rl  and  R2  relaxivity  data  were  mea¬ 
sured  as  a  function  of  field  strength  for  the  genetic  and 
chemically-modified  viral  capsid  contrast  agents. 

MATERIALS  AND  METHODS 

Engineering  CCMV  to  Express  the  Metal  Binding 
Sequence  of  Calmodulin 

The  SubE/R26C/K42R  gene,  a  mutant  of  the  coat  protein  of 
the  CCMV  capsid,  cloned  into  the  Pichia  pastoris  vector; 


pPICZA  (Invitrogen)  was  used  as  the  template  (23). 
QuikChange  site-directed  mutagenesis  (Stratagene)  using 
the  primer;  (5'cgaggaattcatgtctacagacaaagatggtgatggatggtta- 
gaattcgaagagggtgggggcgaagagaacgaggagaacac3 ' ) ,  and  its  re¬ 
verse  complement  was  used  to  insert  the  calmodulin  de¬ 
oxyribonucleic  acid  (DNA)  sequence  into  the  N-terminus 
of  the  coding  region  of  the  capsid  protein.  The  mu- 
tagenized  vector  from  CCMV  was  confirmed  by  DNA  se¬ 
quencing  (Applied  Biosystems).  This  modified  CCMV  pro¬ 
tein  was  called  CCMV-CAL. 

Expression  and  Purification  of  CCMV-CAL 

The  mutagenized  CCMV  capsid  protein  gene  was  expressed 
and  purified  in  a  Pichia  pastoris  heterologous  protein  expres¬ 
sion  system  as  previously  described  (23).  High  levels  of  coat 
protein  expression  were  induced  and  yielded  assembled  vi¬ 
ral  protein  capsids  devoid  of  nucleic  acid. 

These  viral  capsids  were  purified  to  near  homogeneity 
by  lysis  of  cells,  followed  by  ion  exchange  chromatogra¬ 
phy.  Size-exclusion  chromatography  (SEC)  was  used  to 
further  purify  the  capsid  and  to  eliminate  any  aggregates  or 
subunit  disassembly  products  potentially  present  in  the 
samples  (Superose  6;  Amersham  Biosciences;  50  mM  4-2- 
hydroxyethyl-l-piperazineethanesulfonic  acid  (HEPES), 
pH  6  .5).  Both  ion  exchange  and  SEC  were  performed  on  an 
Amersham  Akta  purifier  FPLC. 

The  12-residue  binding  sequence  and  three  glycine  res¬ 
idues,  shown  in  (Fig.  2),  replaced  the  original  residues 


1  11 

Unmodified:  MSTVGTGELT  EAQEEAAAEE 
CCMV-CAL :  MSTDKDGDGW  LEFEEGGGEE 


FIG.  2.  Inside  view  of  CCMV’s  viral  capsid.  Blue  highlights  are 
residue  27  in  the  six-fold  environment.  Red  highlights  are  residue  42 
in  the  five-fold  environments.  The  first  20  amino  acids  are  shown  for 
both  the  unmodified  viral  subunit  and  genetically  modified  subunit. 
The  underlined  residues  are  responsible  for  metal  binding. 
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4-18  of  the  SubE/R26C/K42R  CCMV  mutant.  This  replace¬ 
ment  was  confirmed  at  the  protein  level  by  liquid  chroma¬ 
tography/mass  spectrometry  (LC/MS)  of  the  purified  pro¬ 
tein  (Agilent  Technologies  1100  LC  system  coupled  to  an 
Esquire  3000  ion  trap  mass  spectrometer,  Bruker  Dalton- 
ics).  The  theoretical  mass  was  calculated  by  considering 
loss  of  the  N-terminal  methionine  and  acetylation  of  the 
second  residue  (serine).  Protein  concentration  was  deter¬ 
mined  by  the  absorbance  at  280  nm  (e  =  29280  M-1cm-1  for 
CCMV-CAL,  e  values  were  calculated  by  inserting  amino 
acid  sequences  into  the  ProtParam  tool  at  http://ca.expasy. 
org/tools/protparam.html). 

Synthesis  of  CCMV-  DOTA  Capsids 

The  following  buffers  were  used  in  the  synthesis  of  CCMV- 
DOTA;  labeling  buffer  (100  mM  HEPES,  100  mM  NaCl,  pH 
7.2),  and  storage  buffer  (100  mM  HEPES,  100  mM  NaCl,  pH 
6.5).  A  lysine  reactive  form  (NHS-ester)  of  the  metal  che¬ 
lator  DOTA  was  used  in  the  synthesis  (Macrocyclics, 
B-280).  Figure  3  outlines  the  general  reaction  scheme.  The 
K42R  mutant  of  the  CCMV  (0.5-3  mg/ml,  25-150  |xM 
subunit)  virus  particle  was  purified  from  infected  plants 
(as  previously  described)  and  dialyzed  into  500  mL  of 
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FIG.  3.  The  reaction  scheme  to  attach  DOTA-Gd3'  to  the  CCMV 
viral  capsid.  Endogenous  lysines  on  the  viral  capsid  are  reacted  to 
a  DOTA/NHS  conjugation.  Next,  GdCI3  is  added  to  produce  a  viral 
capsid  conjugated  with  Gd3  ‘  ions. 


labeling  buffer  for  3  h  (24).  The  concentration  of  plant 
virus  was  calculated  by  multiplying  the  absorbance  at  260 
nm  by  6.4  to  yield  a  concentration  of  CCMV  subunit  in  pM 
units.  A  concentration  of  1  to  2  mg/ml  (  —  50  to  100  pM 
subunit)  was  typically  used  in  the  reaction.  A  20X  (mole: 
mole)  of  NHS-ester  DOTA  was  added  next.  The  pH  was 
maintained  at  7.0  by  additions  (1-5  pi)  of  0.5  M  NaOH. 
The  reaction  mixture  was  monitor  by  LC/MS  (both  Stan¬ 
dard  and  Nano  Aquity  LC  systems  and  both  Q-Tof  Micro 
and  Q-Tof  Premier  mass  spectrometers  were  used).  SEC 
and  reverse-phase  separation  techniques  were  used.  The 
deconvolution  program  MaxEntl  (Waters)  was  used  to  de¬ 
termine  the  percent  of  subunits  with  DOTA  covalently 
linked  to  them.  Equation  [1]  was  used  to  approximate  the 
average  labeling  of  subunits  with  DOTA,  where  D  is  the 
number  of  DOTAs  attached  to  the  subunit  and  ID  is  the 
intensity  of  the  ion  corresponding  to  a  subunit  with  D 
DOTAs  attached. 


AverageDOTA 

Subunit 
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The  reaction  was  allowed  to  proceed  for  2  h  at  25°C.  At 
that  point,  LC/MS  analysis  revealed  that  the  majority  of 
NHS-DOTA  reactant  was  hydrolyzed.  The  reaction  was 
repeated  (with  additions  of  NHS-ester  DOTA)  until  there 
was,  on  average,  one  DOTA  covalently  attached  per  sub¬ 
unit.  Unreacted  DOTA  was  removed  by  dialyzing  the  re¬ 
action  mixture  into  labeling  buffer.  Next  the  CCMV-DOTA 
was  dialyzed  into  labeling  buffer  with  10X  GdCl3  (moles  of 
Gd3+:moles  of  subunit)  and  a  pH  of  7.0.  The  progression  of 
the  metal  loading  onto  the  CCMV-DOTA  was  monitored  by 
LC/MS.  Free  Gd3  +  was  separated  from  the  CCMV- 
DOTA-Gd  by  dialyzing  the  labeled  capsid  into  storage 
buffer  that  contained  5  mM  ethylene  diamine  tetraacetic 
acid  (EDTA).  Then  multiple  dialysis  steps  were  performed 
into  storage  buffer.  Alternatively  the  CCMV-DOTA-Gd  was 
separated  from  free  Gd3+  by  running  the  reaction  product 
over  SEC  using  storage  buffer  as  the  eluent. 

Characterization  of  the  Modified  CCMV  Capsids 

Characterization  of  both  purified  capsids  was  performed 
by  SEC,  dynamic  light  scattering  (DLS),  transmission  elec¬ 
tron  microscopy  (TEM),  and  LC/MS.  Typical  data  from 
these  characterization  techniques  is  shown  in  Fig.  4.  DLS 
analysis  was  performed  on  a  ZetaPlus  (Brookhaven  Instru¬ 
ments).  TEM  (Leo  912AB)  of  negatively-stained  samples 
(1%  uranyl  acetate)  was  performed. 

Characterization  GdDOTA  Reaction  Site  in  CCMV-DOTA- 
Gd 

After  removal  of  unbound  Gd3'  from  a  solution  of  CCMV- 
DOTA-Gd,  inductively-coupled  plasma  mass  spectrometry 
(ICP-MS)  analysis  was  used  to  determine  the  total  concen¬ 
tration  of  Gd3'  bound  to  the  viral  capsid  (7500;  Agilent 
Technologies).  Protein  concentrations  of  the  CCMV- 
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FIG.  4.  Data  from  routine  virus  capsid  characterization  of  CCMV- 
CAL  or  CCMV-DOTA.  a:  Size-exclusion  chromatogram  showing  the 
280-nm  absorbance  of  the  three  main  components.  The  small  left 
peak  is  aggregated  virus  capsids.  The  large  middle  peak  is  intact 
capsids  eluting  at  the  correct  retention  volume  and  the  peaks  to  the 
right  are  buffer  molecules,  b:  DLS  indicates  a  viral  capsid  mean 
diameter  of  30  nm.  c:  Transmission  electron  micrograph  of  nega¬ 
tively-stained  viral  particles,  d:  The  deconvoluted  spectrum  shows 
the  correct  subunit  mass.  Inset  is  the  raw  electrospray  mass  spec¬ 
trum  of  viral  capsid  subunits. 


DOTA-Gd  samples  were  determined  by  the  bicinchoninic 
acid  (BCA)  assay  (Pierce).  Protease  digestion  (Trypsin 
Gold;  Promega)  of  CCMV-DOTA-Gd  was  carried  out 
(1  mg/ml  protein  subunit,  1:200  subunit.trypsin,  37°C, 
12  h)  and  liquid  chromatography/tandem  mass  spectrom¬ 
etry  LC/MS/MS  was  performed  on  the  digested  sample 
(nanoAquity  coupled  to  a  Q-Tof  Premier).  Data  was  ana¬ 
lyzed  with  PLGS2  and  MassLynx  (Waters). 

Generation  of  Binding  Isotherms  for  CCMV-CAL 

The  lanthanides  ions  Gd3+  and  terbium  (Tb34)  are  known 
to  hind  with  similar  affinities;  however,  Gd3+  does  not 
undergo  FRET  and  therefore  cannot  be  used  to  probe  metal 
binding.  Therefore  Tb3+  was  used  as  a  Gd3 1  mimic  to  study 
metal  binding.  All  fluorescence  experiments  were  per¬ 
formed  on  a  (SPEX  Fluorolog)  spectrophotometer  at  25°C. 
The  fluorescence  spectrum  of  CCMV-CAL-Tb  was  mea¬ 
sured  (\  max  =  340  nm)  following  excitation  at  295  nm, 
with  a  Tb34  emission  max  was  near  550  nm.  The  340  nm 
peak  was  recorded  by  scanning  from  305  nm  to  575  nm  in 
1  nm  steps.  The  550  peak  was  monitored  by  scanning  from 
525  nm  to  575  nm  in  0.2  nm  steps.  Excitation  and  emission 
slit  widths  were  set  to  4  and  8  nm,  respectively.  Fluores¬ 
cence  was  measured  on  500  pi  solutions  of  viral  capsid  to 
which  metal  ion  was  added  in  5-20  pL  increments  from 
10-500  pM  solution  standards.  Subunit  protein  concen¬ 
trations  of  0.05  pM  and  0.1  pM  for  the  CCMV-CAL  mutant 
were  used. 

Calculation  of  Kd 

In  analyzing  the  data  for  Tb3+  binding  to  the  CCMV-CAL 
capsid,  we  have  assumed  that  when  metal  ions  are  titrated 
to  the  CCMV-CAL  capsid,  they  first  bind  completely  to  the 
inserted  sites.  Once  the  inserted  sites  are  completely  oc¬ 


cupied  then  the  endogenous  sites  start  to  bind  the  addi¬ 
tional  metal  ions  that  are  added.  Equation  |2]  was  used  to 
analyze  the  data  where  0  is  the  fraction  of  capsid  subunits 
with  bound  Tb3  +  ,  \Tb34  Free]  is  the  free  Tb34  concentration, 
and  Kd  is  the  dissociation  constant  for  subunit  binding 
Tb3+  ions. 


0  = 


TblL 


[TfcjU  +  Kd 


[1.2| 


Baseline  correction  was  performed  on  the  550  nm  Tb3 
spectra.  To  find  the  maximum  intensity  of  the  550-nm 
peak,  a  Gaussian  function  was  individually  fit  to  each 
spectra.  The  maximum  intensity  of  the  550-nm  peak  (/550) 
vs.  the  total  Tb34  in  solution  ([ Tb3-totai])  was  plotted  and  fit 
to  Eq.  [3],  where  KdJnitiai  and  hsoumit  were  the  fitting  pa¬ 
rameters. 


/  [  Tbj'0faj\  \ 

*550  —  *550 Liniit\  f  rr,L.l  +  1  i  is  )• 

\  L  *  ™ Total]  '  cl  Initial/ 


[1.3| 


Fractions  bound  terms  (0)  were  calculated  by  dividing  I550 
values  by  I550unut  determined  from  Eq.  [ 3 1.  Values  for 
\Tb3+Free\  were  then  calculated  by  Eq.  [4]. 


[Tb^]  =  [Thy0,J  -  (B [subimitTolal]).  1 14] 

A  plot  of  0  vs.  [Tb^frJ  was  then  fit  with  the  Eq.  |2|  and  Kd 
was  determined,  along  with  an  error  associated  with  the 
fit. 


STIOCHIOMETRIC  TITRATION  OF  CCMV-CAL 

Terbium  ions  (Tb3+)  were  used  as  Gd3+  ion  mimics.  Tb3* 
titrations  were  performed  with  the  condition  of  [subunit | 
»  Kd  to  determine  the  number  of  Tb3  4  bound  per  CCMV- 
CAL  subunit.  Two  titrations  were  performed  in  which  the 
protein  concentration  used  was  2.6  pM  and  10  pM.  The 
capsid  was  titrated  to  —20  pM  total  Tb34  in  both  experi¬ 
ments.  The  data,  from  both  the  beginning  and  end  portions 
of  the  titration,  were  fit  to  linear  functions. 


RELAXOMETRY  AND  GD3+  QUANTITATION 

For  relaxometry  experiments,  fully  assembled  CCMV-CAL 
(60  pM  subunit)  containing  200  pM  GdCl3  was  prepared  in 
pH  6.5  buffer  (50  mM  HEPES,  150  mM  NaCl).  As  a  control, 
200  pM  GdCl3  was  prepared  in  the  same  buffer.  CCMV- 
DOTA-Gd  was  prepared  in  pH  6.5  buffer  (100  mM  HEPES, 
100  mM  NaCl)  with  a  subunit  concentration  of  101  pM 
(determined  by  the  BCA  assay)  and  34  pM  Gd3+  (deter¬ 
mined  by  ICP-MS).  Using  a  custom-designed  variable  field 
relaxometer,  7\  relaxivity  was  measured  using  a  satura¬ 
tion-recovery  pulse  sequence  with  32  incremental  t  val¬ 
ues.  The  range  of  Larmor  frequencies  was  2-62  MHz 
(0.05-1.5  T)  and  the  measurements  were  carried  out  at  a 
temperature  of  23°C.  T ^  values  were  determined  by  fitting 
data  into  Eq.  [5]  with  A  and  B  as  fitting  parameters. 
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T2  was  measured  using  a  CPMG  pulse  sequence  with  500 
echoes  and  an  interecho  time  of  2  ms.  T2  values  were 
determined  by  fitting  data  into  Eq.  [6]  with  A,  B,  and  N  as 
fitting  parameters. 
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Since  the  Rl  and  R2  relaxivities  are  expressed  in  units  of 
(mM-1  of  bound  Gd3  +  *  s_1),  it  was  necessary  to  determine 
the  mM  concentration  of  bound  Gd3+.  The  calculation  of 
the  fraction  of  CCMV-CAL  with  hound  Gd3+  turned  into  an 
approximation  since  this  capsid  contains  two  types  of 
binding  sites.  First,  the  parameters  for  the  higher  affinity 
binding  site  or  “CAL”  were  input  into  Eq.  [7].  The  concen¬ 
tration  of  binding  sites  \BS],  \Gd3+Totai\>  and  the  K(i  are  all 
values  input  into  this  equation. 


([BS]  +  [7KJ  T  K(l 

-  V([£S]  +  [Tb%J\  +  Kd)2  -  (4[BS][Tb'£tJ)] 

2  [BS] 

[1.71 


Free  Tb  /  jiM 

FIG.  5.  Binding  isotherms  for  CCMV-CAL  binding  Tb3 '  ions.  The 
open  triangle,  square,  and  circle  represent  three  replicate  experi¬ 
ments  for  the  increase  of  the  550-nm  peak.  The  solid  triangle 
represents  the  decrease  in  the  340-nm  peak  from  the  first  experi¬ 
ment.  All  four  data  sets  were  used  for  the  fit. 


suiting  in  high-affinity  Gd34  binding  and  imparting  highly 
efficient  relaxivity  properties  to  the  CCMV  capsid  (CCMV- 
DOTA-Gd). 

Genetic  Modification  of  CCMV — Attachment  of  Calmodulin 
Peptide 


From  this,  the  fraction  bound  term  for  the  CAL  binding  site 
(Qcal)  could  he  determined  and  was  further  used  (Eq.  [8 1) 
to  calculate  the  |Gd3  Vmel  left  after  the  CAL  site  was  max¬ 
imally  bound. 

GdFrPft  =  Gdrotai—  ([CAL]  •  0ou)*  1 1  1 

The  fraction  of  Gd34  bound  to  the  endogenous  site 
(H Endogenous )  was  calculated  by  setting  [Gd3+Totai],  which  was 
input  into  Eq.  [7],  equal  to  [Gd3+Free\,  which  was  deter¬ 
mined  by  Eq.  [8|.  K(i  and  | BS]  values  for  the  endogenous 
site  were  input  into  Eq.  [ 7 1  to  determine  the  fraction  bound 
for  the  endogenous  site  (B£/ldogff/lotJS).  Finally,  the  fraction- 
bound  terms  for  both  the  endogenous  site  (B^dogfi/ious)  and 
the  engineered  site  were  multiplied  by  the  millimo- 

lar  concentration  of  their  respective  binding  sites  and  then 
added  together  resulting  in  a  millimolar  concentration  of 
total  bound  Gd34. 

RESULTS 

CCMV  Modified  Architecture 

The  CCMV  viral  capsid  (Fig.  1)  has  been  modified  for 
enhanced  Gd34  binding,  using  two  complementary  ap¬ 
proaches,  while  maintaining  the  advantages  of  the  large 
molecular  platform.  In  the  first  approach,  a  Gd34  binding 
peptide  from  calmodulin,  was  genetically  introduced  onto 
the  N-terminus  of  the  CCMV  viral  capsid  subunit  (CCMV- 
CAL-Gd).  This  modified  viral  capsid  has  an  increased  af¬ 
finity  for  Gd34  in  comparison  with  wild-type  CCMV.  In  a 
second  approach,  GdDOTA  was  conjugated  to  CCMV  re- 


A  peptide  sequence  from  the  Ca24  binding  portion  of  the 
protein  calmodulin  (DKDGDGWLEFEEGGG)  was  geneti¬ 
cally  fused  to  the  N-terminus  of  CCMV  (Fig.  2).  Interest¬ 
ingly,  this  construct  with  a  nine-residue  peptide  incorpo¬ 
rated  as  an  N-terminal  fusion  did  not  disrupt  the  ability  of 
CCMV  to  self  assemble  as  shown  by  SEC,  DLS,  and  TEM 
analyses  (Fig.  4).  The  mutation  of  the  coat  protein  subunit 
gene  was  confirmed  by  DNA  sequencing.  LC/MS  of  the 
purified  viral  capsid  subunit  produced  an  experimental 
average  mass  of  20,234  Da  compared  to  a  calculated  aver¬ 
age  mass  of  20,232  Da  (Supplemental  Data)  for  the  mutant 
protein  subunit  confirming  the  identity  of  the  recombinant 
protein. 

Kd  Determination  of  CCMV-CAL 

The  metal  binding  affinity  of  the  CCMV-CAL  mutant  was 
probed  by  FRET  using  excitation  of  endogenous  trypto¬ 
phan  residues.  The  lanthanide  ions  Gd34  and  terbium 
(Tb34)  are  known  to  bind  with  similar  affinities  and  both 
show  preference  for  Ca24  binding  sites  in  proteins  al¬ 
though  with  significantly  higher  affinities  than  Ca2  bind¬ 
ing  (17,25).  Titration  of  the  CCMV-CAL  mutant  with  in¬ 
creasing  Tb34  revealed  a  decrease  in  the  tryptophan  fluo¬ 
rescence  (340  nm)  and  concomitant  increase  in  the  Tb31 
fluorescence  (at  550  nm)  indicating  energy  transfer  be¬ 
tween  these  sites  (Fig.  5).  The  complete  data  set  was  fit  to 
Eq.  [2]  and  an  average  Kd  of  82  ±  14  nM  for  Tb34  binding 
to  CCMV-CAL  was  determined.  This  indicates  an  enhance¬ 
ment  in  the  metal  binding  affinity  of  232-fold  over  binding 
to  endogenous  sites  in  the  wild-type  CCMV  (Kd  of 
19,000  nM  for  Tb34)  (21). 
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FIG.  6.  A  stoichiometric  titration  of  CCMV  CAL  with  Tb3  ions.  Two 
replicate  data  sets  are  shown  here,  one  with  circles  and  the  other 
with  triangles. 


FIG.  7.  Typical  deconvolved  electrospray  mass  spectra  of  CCMV 
capsid  subunits.  CCMV  capsids  were  reacted  with  NHS-DOTA  and 
then  GdCI3.  Unlabelled  subunits  and  subunits  with  one  to  three 
GdDOTA  were  detected. 


Stoichiometric  Titrations  of  CCMV-CAL 

A  stoichiometric  titration  of  Th3+  was  performed  to  deter¬ 
mine  the  number  of  ions  bound  per  CCMV-CAL  subunit. 
When  normalized  fluorescence  intensity  was  plotted 
against  the  ratio  of  subunit:Tb34  ([ subunit ]/[  Tb3"Totai\)  as 
shown  in  (Fig.  6),  the  data  shows  two  distinct  regions.  The 
fluorescence  response  during  the  first  part  of  the  titration 
([binding  site]  >>  [metal])  increases  linearly  with  Tb34 
addition.  The  second  linear  portion  of  the  titration  ([metal 
>>  [binding  site!)  shows  a  relatively  constant  fluorescent 
response  with  a  smaller  slope.  The  x-intersection  of  the  fits 
to  each  of  these  two  regions  indicates  the  point  at  which 
the  binding  sites  are  maximally  occupied.  Our  data  indi¬ 
cate  a  value  of  7.3  ±  1.1  ([subunit  |/[Tb3  4  7Of0/])  for  this 
intersection.  At  the  end  of  the  Tb34  titration,  there  is  one 
Tb34  ion  for  every  7.3  ±  1.1  subunits  or  each  CCMV  capsid 
has  approximately  25  metal  ions  bound  at  the  introduced 
peptide  sites. 

Chemical  Modification  of  CCMV — Attachment  of  GdDOTA 

GdDOTA  was  covalently  attached  to  the  surface  of  CCMV 
via  reaction  with  surface  exposed  lysine  residues  (Fig  3).  A 
deconvoluted  mass  spectrum  after  a  routine  CCMV- 
DOTA-Gd  synthesis  and  removal  of  unbound  Gd34  showed 
a  distribution  of  0  to  3  GdDOTA  per  subunit  (Fig.  7).  The 
addition  of  a  single  GdDOTA  added  an  experimental  mass 
of  544  Da  to  the  subunit  molecular  weight,  which  corre¬ 
sponds  well  with  the  theoretical  value  of  543.7  Da  per 
GdDOTA.  It  was  qualitatively  observed  by  LC/MS  that  up 
to  two  DOTA  on  average  per  subunit  produced  a  stable, 
fully  assembled  capsid  yielding  up  to  360  GdDOTA  per 
capsid. 

Using  mass  spectrometry  we  were  able  to  map  the  ly¬ 
sines  which  are  the  sites  of  labeling.  LC/MS/MS  analysis 
indicates  that  GdDOTA  is  primarily  attached  through  Lys 
8  and  Lys  45  (shown  in  Supplemental  Data).  The  distribu¬ 
tion  of  GdDOTA  labeling  on  CCMV-DOTA-Gd  suggests 
that  there  is  not  complete  occupancy  of  either  Lys  45  or 


Lys  8  within  the  capsid  and  that  additional  unidentified 
residues  are  also  labeled.  However,  residues  8  and  45  are 
likely  the  most  prevalently  labeled  lysines  in  CCMV.  Fig¬ 
ure  8  shows  the  inside  view  of  the  CCMV  capsid  with 
GdDOTA  modeled  onto  residue  45.  The  position  of  lysine 
45  in  the  structure  suggests  that  the  attached  GdDOTA 
resides  on  the  interior  of  the  capsid.  Residue  8  is  not 
shown  since  the  N-terminus  is  disordered  in  the  X-ray 
crystal  structure  of  CCMV  and  its  position  is  therefore 
uncertain. 

Relaxometry  of  CCMV-CAL-Gd  and  CCMV-DOTA-Gd 

Highly  efficient  7^  and  T2  relaxivity  properties  were  ob¬ 
served  in  both  capsids  (CCMV-CAL-Gd  and  CCMV-DOTA- 
Gd).  Table  1  summarizes  Rl  and  R2  values  for  Gd3t /virus 
systems,  including:  wild-type  CCMV-Gd,  CCMV-CAL-Gd, 


FIG.  8.  Cutaway  view  of  the  interior  of  CCMV.  GdDOTA  was  mod¬ 
eled  to  be  attached  through  Lys  45.  The  red  highlighted  view  is  a 
close-up  of  Lys  45  in  the  six-fold  environment  while  the  blue  high¬ 
lighted  view  is  of  Lys  45  in  a  five-fold  environment. 
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Table  1 

Relaxivities  of  Capsid-Gd3*  Constructs  at  1.5  T 


Construct 

Gd/particlea 

rl  (s  T 
(mM  Gd3")  1 

r2  (s  ’)* 

(mM  Gd3")-’ 

rl  (s  y 
(mM  particle)  1 

r2  (s  ')* 

(mM  particle)  1 

Wild-type  CCMV-Gd 

1 41 a 

202 

376 

28,482 

53,016 

CCMV-CAL-Gd 

1 72ab 

210 

402c 

36,120 

69,144 

CCMV-DOTA-Gd 

61a 

46 

142 

2806 

8662 

MS2-DTPA-Gd 

51 4a 

14 

NA 

7196 

NA 

DOTA-Gd 

1 

3d 

5d 

3d 

5d 

aGd/particle  (mole/mole)  of  the  samples  used  in  the  relaxometry  experiment. 
Calculated  by  setting  CAL-sites/Gd  =  7.3. 
cRelaxivity  measured  at  1 .3  T. 
dRelaxivity  measured  at  1 .0  T. 

NA  =  Not  available. 


CCMV-DOTA-Gd,  and  MS2-DTPA-Gd  (4,17).  The  ionic  Rl 
and  R2  trends  are  CCMV-CAL-Gd  >  CCMV-Gd  >  CCMV- 
DOTA-Gd  >  MS2-DTPA-Gd.  The  Rl  and  R2  trends  relative 
to  the  particle  are  CCMV-CAL-Gd  >  wild-type  CCMV- 
Gd  >  MS2-DTPA-Gd  >  CCMV-DOTA-Gd.  In  the  genetic 
approach,  the  CCMV-CAL  mutant  exhibited  approxi¬ 
mately  the  same  TA  and  T2  relaxivity  as  wild-type 
CCMV-Gd  previously  reported  (17).  Figure  9a  and  h  shows 
the  similarities  between  the  7\  and  T2  relaxivity  values  of 
wild-type  CCMV-Gd  and  CCMV-CAL-Gd  for  field 
strengths  ranging  from  0.2  to  1.5  T.  This  construct  not  only 
has  an  increased  affinity  for  Gd3+,  but  also  maintains  the 
very  high  relaxivity  required  for  clinically  relevant  con¬ 
trast  agents. 

The  increased  TA  and  T2  relaxivity  afforded  by  the  chem¬ 
ical  conjugation  of  GdDOTA  to  CCMV’s  capsid  is  shown 
(Fig.  9a  and  b).  This  conjugation  resulted  in  increased  T ^ 
and  T2  relaxivities  by  a  factor  of  10  relative  to  free  Gd¬ 
DOTA.  By  chemically  attaching  GdDOTA  to  the  CCMV 
capsid  we  have  engineered  a  viral  capsid  with  clinically 
relevant  binding  and  high  relaxivity. 

The  Rl  dependence  on  field  strength,  for  both  wild-type 
CCMV-Gd  and  CCMV-CAL-Gd,  varies  in  manner  typical  of 
nanoscale  systems  with  a  relaxivity  maximum  near  1  T 
(26).  CCMV-DOTA-Gd  has  Rl  field  strength  dependence 
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FIG.  9.  a:  Rl  values  for  two  CCMV  capsids  conjugated  with  Gd3+ 
ions,  wild-type  CCMV-Gd  (circles)  and  CCMV-CAL-Gd  (squares),  b: 
R2  values  for  two  CCMV  capsids  conjugated  with  Gd3+  ions,  wild- 
type  CCMV-Gd  (circles),  and  CCMV-CAL-Gd  (squares),  c:  Rl  val¬ 
ues  for  GdDOTA  (triangle)  and  CCMV-DOTA-Gd  (squares),  d:  R2 
values  for  GdDOTA  (triangle)  and  CCMV-DOTA-Gd  (squares). 


more  similar  to  small  molecule  systems  with  a  maximum 
near  0.5  T.  All  CCMV/Gd  systems  exhibit  a  positive  cor¬ 
relation  between  field  strength  and  T2  relaxivity  values,  as 
expected. 

DISCUSSION 

The  major  achievement  of  this  study  is  demonstration  that 
the  CCMV  viral  capsid  is  a  robust  platform  for  develop¬ 
ment  of  MR  contrast  agents.  Specifically  we  have  shown 
that  a  genetically-engineered  CCMV  capsid  binds  Gd3 ' 
(CCMV-CAL-Gd)  with  a  significantly  higher  affinity  than 
wild-type  CCMV.  In  a  second  approach,  the  CCMV  capsid 
has  been  chemically  modified  resulting  in  the  attachment 
of  multiple  GdDOTA  (CCMV-DOTA-Gd).  These  amine 
conjugated  GdDOTA  groups  have  been  previously  shown 
to  bind  with  a  dissociation  constant  of  10-20,  which  corre¬ 
sponds  to  1015  improvement  in  metal  affinity  compared  to 
wild-type  CCMV-Gd  (27).  Both  these  nanoparticle  contrast 
agents  are  water  soluble  and  have  high  ionic  and  particle 
relaxivities.  These  viral  capsids  have  multiple  Gd34  ions 
attached  per  capsid,  resulting  in  the  potential  for  high 
contrast  associated  with  tissue-specific  localization.  Local¬ 
ization  of  the  viral  capsid  could  be  accomplished  by  active 
targeting  or  by  passive  means. 

It  is  surprising  that  we  observed  substoichiometric  bind¬ 
ing  of  Gd34  to  the  introduced  CAL  peptide  in  the  CCMV- 
CAL-Gd  construct.  The  data  suggest  that  only  25  out  of  180 
sites  bind  Gd34  at  saturating  conditions.  A  likely  possibil¬ 
ity  is  that  not  all  the  introduced  sites  are  accessible  to  bind 
metal  ions  due  to  different  chemical  environments  of  the 
CAL  peptides.  It  is  also  worth  mentioning  that  the  N- 
terminus  is  grouped  in  two  environments,  20  pseudo  six¬ 
fold  and  12  five-fold  environments  (Fig.  2).  The  combina¬ 
tions  of  these  groupings  yields  an  average  of  5.6  subunits 
per  N-terminus  grouping.  This  value  is  reasonably  close  to 
the  experimentally  determined  value  of  7.3  ±  1.1  CAL 
peptides  per  bound  metal  ion.  It  is  possible  that  single 
metal  ions  are  bound  by  multiple  CAL  peptides  grouped  at 
these  five-fold  and  pseudo  six-fold  environments.  The  dis¬ 
sociation  constant  of  82  ±  14  nM  for  CCMV-CAL-Gd34,  in 
comparison  to  dissociation  constants  in  the  range  of  1()-20 
M  for  approved  contrast  agents,  indicates  that  the  CAL 
peptide  binds  Gd34  too  weakly  for  clinical  application. 
Therefore,  this  approach  of  genetically  attaching  metal 
binding  peptides  to  the  CCMV  capsid  is  not  likely  to  be  as 
fruitful  as  chemical  modification  approaches. 


878 


Liepold  et  al. 


It  was  expected  that  all  three  CCMV/Gd  constructs 
(CCMV-Gd,  CCMV-CAL-Gd,  and  CCMV-DOTA-Gd)  would 
have  similar  relaxivity  values  since  size  was  thought  to  be 
the  dominate  factor  in  determining  relaxivity  rates.  How¬ 
ever,  CCMV-DOTA-Gd  has  ionic  relaxivity  values  that  are 
approximately  25%  of  the  values  for  CCMV-Gd  and 
CCMV-CAL-Gd,  indicating  that  factors  other  than  size  can 
influence  the  relaxivity  of  these  capsids.  The  endogenous 
Gd3+  binding  pocket  in  wild-type  CCMV  is  at  the  interface 
of  three  subunits  and  contains  side  chains  from  each  sub¬ 
unit.  The  result  of  this  intersubunit  binding  pocket  is  that 
the  overall  motion  of  the  Gd3+  ion  is  identical  to  the 
overall  motion  of  the  entire  capsid  and  there  is  no  addi¬ 
tional  motion  of  the  Gd3+  ion.  This  is  in  contrast  to  the 
DOTA  bound  Gd34  of  CCMV-DOTA-Gd  in  which  the  flex¬ 
ibility  of  the  linker  connecting  Gd  to  the  capsid  adds 
additional  motion  to  the  Gd3+  ion.  In  addition,  LC/MS/MS 
data  of  a  CCMV-DOTA-Gd  trypsin  digest  indicates  that  the 
GdDOTA  is  primarily  attached  on  the  N-terminus  end  of 
the  capsid  subunit  through  Lys  8  and  Lys  45.  It  is  known 
that  the  N-terminus  of  CCMV  is  mobile  and  can  occupy 
both  the  interior  and  exterior  of  the  CCMV  capsid  archi¬ 
tecture  (28,29).  This  increased  local  mobility  of  the  region 
of  the  viral  capsid  that  is  labeled  with  GdDOTA  may  cause 
a  reduction  in  relaxivity.  Electron  paramagnetic  resonance 
(EPR)  spectroscopy  studies  of  CCMV  labeled  with  another 
small  molecule  concluded  that  there  was  local  mobility 
within  the  spin  label  itself,  which  decreased  the  rotational 
correlation  time  when  compared  to  the  predicted  value  for 
the  CCMV  capsid  (30-34).  This  local  mobility  of  the  label 
may  also  exist  in  CCMV-DOTA-Gd,  which  would  lead  to 
an  additional  decrease  in  relaxivity  rates.  Local  mobility 
from  flexible  regions  of  the  protein  as  well  as  the  GdDOTA 
itself  could  explain  the  lower  relaxivity  values  of  CCMV- 
DOTA-Gd  compared  to  CCMV-Gd  and  CCMV-CAL-Gd. 

The  consideration  of  gadolinium’s  ligands  offers  another 
explanation  for  the  relaxivity  differences  between  CCMV- 
DOTA-Gd  and  the  two  other  CCMV/Gd34  systems 
(CCMV-Gd  and  CCMV-CAL-Gd).  Multiple  water  mole¬ 
cules  are  thought  to  be  ligands  of  Gd3f  in  both  CCMV-Gd 
and  CCMV-CAL-Gd  in  contrast  to  the  GdDOTA  system 
where  a  single  water  molecule  is  coordinated  by  the  che¬ 
lated  Gd3+  atom.  The  increased  number  of  bound  water 
molecules  makes  CCMV-Gd  and  CCMV-CAL-Gd  more  ef¬ 
ficient  at  relaxation  of  water  molecules  compared  to 
CCMV-DOTA-Gd.  Also,  it  was  found  that  a  peptide  similar 
to  the  genetically  fused  sequence  used  in  this  work,  re¬ 
sulted  in  unexpectedly  high  relaxivity  values.  These  au¬ 
thors  suggested  that  the  high  relaxivity  values  were  a  re¬ 
sult  of  having  the  Gd3+  coordinated  primarily  with  oxygen 
atoms  since  oxygen  ligand  systems  resemble  the  aqua  Gd3+ 
complex,  which  has  near  optimal  lifetimes  for  bound  wa¬ 
ter  molecules  (35-37).  This  ideal  oxygen  environment  is 
found  in  CCMV-Gd  and  CCMV-CAL-Gd,  whereas  in  the 
DOTA  system  one  of  the  four  carboxylate  groups  is  con¬ 
verted  to  an  amide  bond  in  the  conjugation  reaction.  This 
conversion  increases  the  lifetime  of  the  bound  water  mol¬ 
ecule  from  approximately  250  ns  to  greater  than  1  ms, 
which  is  further  from  the  theoretical  ideal  lifetime  of 
20-30  ns  (38).  The  larger  value  for  the  lifetime  of  bound 
water  in  the  DOTA  system  probably  accounts  for  some 


reduction  of  the  relaxivity  rate  of  CCMV-DOTA-Gd  com¬ 
pared  to  CCMV-CAL-Gd. 

Another  possible  example  of  how  local  mobility  can 
affect  relaxivity  rates  is  revealed  by  comparing  two  differ¬ 
ent,  Gd34  chelated,  viral  capsid  systems  (CCMV-DOTA-Gd 
and  MS2-DTPA-Gd).  Anderson  et  al.  (4)  have  attached  up 
to  500  Gd3+  ions  to  the  MS2  capsid  yielding  ionic 
relaxivity  rates  that  are  approximately  three  times  lower 
than  the  T ionic  relaxivity  rates  for  CCMV-DOTA-Gd  (Ta¬ 
ble  1).  This  deviation  is  larger  than  expected  since  the 
relaxivity  difference  between  DOTA  and  DTPA  is  small; 
the  two  capsids  are  the  same  size  and  in  both  systems  the 
chelators  are  attached  to  endogenous  lysines.  Further¬ 
more,  the  isothiocyanate-linked  DTPA  results  in  a  lifetime 
for  bound  waters  closer  to  the  ideal  value  when  compared 
to  the  NHS  ester-linked  DOTA  system,  so  the  MS2/DTPA 
system  should  result  in  higher  relaxivity  rates  (26).  A 
comparison  of  the  length  of  the  resulting  linkers  could 
explain  the  difference  in  observed  relaxivity  rates  between 
the  two  cages.  The  NHS-ester  used  in  the  DOTA  system 
results  in  a  short  linker  with  three  rotatable  bonds.  This  is 
in  contrast  to  the  DTPA  system,  in  which  the  linker  is 
longer  and  has  four  rotatable  bonds.  (The  linker  was  mea¬ 
sured  from  the  lysine’s  amine  nitrogen  to  the  nitrogen  that 
coordinates  the  Gd3+  ion  on  either  chelator.)  The  longer 
and  more  rotatable  linker  in  the  DTPA  system  may  result 
in  more  mobility  and  this  could  account  for  the  higher 
ionic  relaxivity  rates  for  CCMV-DOTA-Gd  compared  to 
MS2-DTPA-Gd.  Finally,  Gd3+  loading  of  up  to  360  Gd3 
ions  per  capsid  has  been  achieved  in  the  CCMV-DOTA-Gd 
construct,  which  yields  a  calculated  particle  relaxivity 
values  of  Rl  =  16,560  and  R2  =  51,120  s^mM'1  at  1.5T. 

Viral  capsids  offer  advantages  over  other  macromolecu- 
lar  contrast  agents.  Capsids  are  generally  more  rigid  than 
liposome  or  dendrimer  systems.  These  protein  cages  are 
homogeneous,  whereas  both  dendrimers  and  liposomes 
are  heterogeneous  mixtures.  Also  the  ability  to  accurately 
determine  which  residues  are  modified  along  with  the 
availability  of  the  near  atomic  resolution  crystal  structure 
provides  information  about  the  arrangement  and  environ¬ 
ment  of  each  modification  within  the  viral  capsid  struc¬ 
ture.  This  information  aids  in  the  design  of  the  agent  since 
the  spatial  arrangement,  structural  rigidity,  and  chemical 
environment  of  these  modifications  can  be  taken  into  ac¬ 
count.  In  conclusion,  the  work  presented  here  shows  the 
potential  of  viral  capsids  as  MRI  contrast  agents  and  will 
direct  the  design  of  the  next  generation  of  these  imaging 
agents. 
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The  Landau-Lifshitz  equation  reliably  describes  magnetization  dynamics  using  a  phenomenological 
treatment  of  damping.  This  Letter  presents  first-principles  calculations  of  the  damping  parameters  for  Fe, 

Co,  and  Ni  that  quantitatively  agree  with  existing  ferromagnetic  resonance  measurements.  This  agreement 
establishes  the  dominant  damping  mechanism  for  these  systems  and  takes  a  significant  step  toward 
predicting  and  tailoring  the  damping  constants  of  new  materials. 
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Magnetic  damping  determines  the  performance  of  mag¬ 
netic  devices  including  hard  drives,  magnetic  random  ac¬ 
cess  memories,  magnetic  logic  devices,  and  magnetic  field 
sensors.  The  behavior  of  these  devices  can  be  modeled 
using  the  Landau-Lifshitz  (LL)  equation  [1] 

m  =  -|y|m  X  Heff  -  -^m  X  (m  X  Heff),  (1) 
nr 

or  the  essentially  equivalent  Gilbert  (LLG)  form  [2,3].  The 
first  term  describes  precession  of  the  magnetization  m 
about  the  effective  field  Heff,  where  y  =  gMoMflA  is  the 
gyromagnetic  ratio.  The  second  term  is  a  phenomenologi¬ 
cal  treatment  of  damping  with  the  adjustable  rate  A.  The 
Gilbert  form  replaces  this  term  with  am  X  m  using  the 
dimensionless  damping  constant  a  =  \/ym.  The  LL(G) 
equation  adequately  describes  dynamics  measured  by  tech¬ 
niques  as  varied  as  ferromagnetic  resonance  (FMR)  [4], 
magneto-optical  Kerr  effect  [5],  x-ray  absorption  spectros¬ 
copy  [6],  and  spin-current  driven  rotation  with  the  addition 
of  a  spin-torque  term  [7,8]. 

Access  to  a  range  of  damping  rates  in  metallic  materials 
is  desirable  when  constructing  devices  for  different  appli¬ 
cations.  Empirically,  doping  NiFe  alloys  with  transition 
metals  [9]  or  rare  earths  [10]  has  produced  compounds 
with  damping  rates  in  the  range  of  a  =  0.01  to  0.8.  A 
recent  investigation  of  adding  vanadium  to  iron  resulted 
in  an  alloy  with  a  decreased  damping  rate  [II].  Unfortu¬ 
nately,  the  damping  rate  of  a  new  material  cannot  be 
predicted  because  there  has  not  yet  been  a  first-principles 
calculation  of  damping  that  quantitatively  agrees  with 
experiment.  The  challenging  pursuit  of  new  materials 
with  specific  or  lowered  damping  rates  is  further  compli¬ 
cated  by  the  expectation  that,  as  device  size  continues  to  be 
scaled  down,  material  parameters,  such  as  A,  should 
change  [12].  A  detailed  understanding  of  the  important 
damping  mechanisms  in  metallic  ferromagnets  and  the 
ability  to  predictively  calculate  damping  rates  would 
greatly  facilitate  the  design  of  new  materials  appropriate 
for  a  variety  of  applications. 


The  temperature  dependence  of  damping  in  the  transi¬ 
tion  metals  has  been  carefully  characterized  through  mea¬ 
surement  of  small  angle  dynamics  by  FMR  [13,14].  While 
one  might  naively  expect  damping  to  increase  monotoni- 
cally  with  temperature,  as  it  does  for  Fe,  both  Co  and  Ni 
also  exhibit  a  dramatic  rise  in  damping  at  low  temperature 
as  the  temperature  decreases.  These  observations  indicate 
that  two  primary  mechanisms  are  involved.  Subsequent 
experiments  [15,16]  partition  these  nonmonotonic  damp¬ 
ing  curves  into  a  conductivitylike  term  that  decreases  with 
temperature  like  the  conductivity  and  a  resistivitylike  term 
that  increases  with  temperature  like  the  resistivity.  The  two 
terms  were  found  to  give  nearly  equal  weight  to  the  damp¬ 
ing  curve  of  Ni. 

A  number  of  mechanisms  for  damping  in  these  systems 
have  been  proposed  [14,17-24].  See  the  review  by 
Heinrich  [25]  for  a  more  complete  discussion.  However, 
none  of  the  models  have  been  shown  to  quantitatively 
agree  with  measured  values.  The  torque-correlation  model 
of  Kambersky  [17]  qualitatively  matches  the  data,  but  has 
not  been  quantitatively  evaluated  in  a  rigorous  fashion. 
Here,  we  report  first-principles  calculations  of  the 
Landau-Lifshitz  damping  constant  according  to 
Kambersky’s  torque-correlation  expression.  Quantitative 
comparison  of  the  present  calculations  to  the  measured 
FMR  values  [13]  positively  identifies  this  damping  path¬ 
way  as  the  dominant  effect  in  the  transition  metal  systems. 
In  addition  to  presenting  these  primary  conclusions,  we 
also  describe  the  relationship  between  the  torque- 
correlation  model  and  the  more  widely  understood  breath¬ 
ing  Fermi  surface  model  [18,21],  showing  that  the  results 
of  both  models  agree  quantitatively  in  the  low  scattering 
rate  limit. 

The  breathing  Fermi  surface  model  of  Kambersky  pre¬ 
dicts 


A  -  g  2^B' 

A~~r 


<» 

This  model  offers  a  qualitative  explanation  for  the  low 
temperature  conductivitylike  contribution  to  the  measured 
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damping.  The  model  describes  damping  of  uniform  pre¬ 
cession  as  due  to  variations  denk/d0  in  the  energies  enk  of 
the  single-particle  states  with  respect  to  the  spin  direction 
0.  The  states  are  labeled  with  a  wave  vector  k  and  band 
index  n.  As  the  magnetization  precesses,  the  spin-orbit 
interaction  changes  the  energy  of  the  electronic  states, 
pushing  some  occupied  states  above  the  Fermi  level  and 
some  unoccupied  states  below  the  Fermi  level.  Thus, 
electron-hole  pairs  are  generated  near  the  Fermi  level 
even  in  the  absence  of  changes  in  the  electronic  popula¬ 
tions.  The  7]  function  in  Eq.  (2)  is  the  negative  derivative  of 
the  Fermi  function  and  picks  out  only  states  near  the  Fermi 
level  to  contribute  to  the  damping,  g  is  the  Lande  g  factor 
and  /jlb  is  the  Bohr  magneton.  The  electron-hole  pairs 
created  by  the  precession  exist  for  some  lifetime  r  before 
relaxing  through  lattice  scattering.  The  amount  of  energy 
and  angular  momentum  dissipated  to  the  lattice  depends  on 
how  far  from  equilibrium  the  system  gets;  thus,  damping 
by  this  mechanism  increases  linearly  with  the  electron 
lifetime  as  seen  in  Eq.  (2).  Since  the  electron  lifetime  is 
expected  to  decrease  as  the  temperature  increases,  this 
model  predicts  that  damping  diminishes  as  the  temperature 
is  raised. 

Because  the  predicted  damping  rate  is  linear  in  the 
scattering  time,  the  damping  rate  cannot  be  calculated 
more  accurately  than  the  scattering  time  is  known.  For 
this  reason  it  is  not  possible  to  make  quantitative  compari¬ 
sons  between  calculations  of  the  breathing  Fermi  surface 
and  measurements.  Further,  while  the  breathing  Fermi 
surface  model  can  explain  the  dramatic  temperature  de¬ 
pendence  observed  in  the  conductivitylike  portion  of  the 
data  it  fails  to  capture  the  physics  driving  the  resistivitylike 
term.  This  is  a  significant  limitation  from  a  practical  per¬ 
spective  because  the  resistivitylike  term  dominates  damp¬ 
ing  at  room  temperature  and  above  and  is  the  only 
contribution  observed  in  iron  [13]  and  NiFe  alloys  [26]. 

Kambersky’s  torque-correlation  model  predicts 

A  =  f^jl r;m(k)\2wnm(k)  o) 

and  we  will  show  that  it  both  incorporates  the  physics  of 
the  breathing  Fermi  surface  model  and  also  accounts  for 
the  resistivitylike  terms.  The  matrix  elements  T~m(k)  = 
(n,  k\[<r~,  //so]|ra,  k)  measure  transitions  between  states 
in  bands  n  and  m  induced  by  the  spin-orbit  torque.  These 
transitions  conserve  wave  vector  k  because  they  describe 
the  annihilation  of  a  uniform  precession  magnon,  which 
carries  no  linear  momentum.  The  nature  of  these  scattering 
events,  which  are  weighted  by  the  spectral  overlap 
Wnm(k)  =  (I/77-)  f  da>lri((Dl)Ank(a)i)Amk((Oi),  will  be 
discussed  in  more  detail  below.  The  electron  spectral  func¬ 
tions  Ank  are  Lorentzians  centered  around  the  band  ener¬ 
gies  enk  and  broadened  by  interactions  with  the  lattice.  The 
width  of  the  spectral  function  h/ r  provides  a  phenomeno¬ 
logical  account  for  the  role  of  electron-lattice  scattering  in 
the  damping  process.  The  77  function  is  the  same  as  in 


Eq.  (2)  and  enforces  the  requirement  of  spectral  overlap  at 
the  Fermi  level. 

Equation  (3)  captures  two  different  types  of  scattering 
events:  scattering  within  a  single  band,  m  =  n,  for  which 
the  initial  and  final  states  are  the  same,  and  scattering 
between  two  different  bands,  m  =£  n.  As  explained  in 
[17]  the  overlap  of  the  spectral  functions  is  proportional 
(inverse)  to  the  electron  scattering  time  for  intraband  (in¬ 
terband)  scattering.  From  this  observation  the  qualitative 
conclusion  is  made  that  the  intraband  contributions  match 
the  conductivitylike  terms  while  the  interband  contribu¬ 
tions  give  the  resistivitylike  terms.  Evaluation  of  Eq.  (3)  is 
more  computationally  intensive  than  that  of  the  breathing 
Fermi  surface  model  and  until  now  only  a  few  estimates  for 
Ni  and  Fe  have  been  made  [19]. 

We  have  performed  first-principles  calculations  of  the 
torque-correlation  model  Eq.  (3)  with  realistic  band  struc¬ 
tures  for  Fe,  Co,  and  Ni.  Prior  to  evaluating  Eq.  (3)  the 
eigenstates  and  energies  of  each  metal  were  found  using 
the  linear  augmented  plane  wave  method  [27]  in  the  local 
spin  density  approximation  [28-30].  Details  of  the  calcu¬ 
lations  for  these  materials  are  described  in  [31].  The  ex¬ 
change  field  was  fixed  in  the  chosen  equilibrium 
magnetization  direction.  Calculations  of  Eq.  (3)  presented 
in  this  Letter  are  converged  to  within  a  standard  deviation 
of  3%,  which  required  sampling  (160)3  k  points  for  Fe, 
(120)3  for  Ni,  and  (100)2  k  points  in  the  basal  plane  by  57 
along  the  c  axis  for  Co.  Electron-lattice  interactions  were 
treated  phenomenologically  as  a  broadening  of  the  spectral 
functions.  The  Fermi  distribution  was  smeared  with  an 
artificial  temperature.  Results  did  not  vary  significantly 
with  reasonable  choices  of  this  temperature  since  the 
broadening  of  the  Fermi  distribution  was  considerably 
less  than  that  of  the  bands.  The  damping  rate  was  calcu¬ 
lated  for  a  range  of  scattering  rates  (spectral  widths)  just  as 
damping  has  been  measured  over  a  range  of  temperatures. 

The  results  of  these  calculations  are  presented  in  Fig.  1 
and  are  decomposed  into  the  intraband  and  interband 
terms.  The  downward  sloping  line  in  Fig.  1  represents 
the  intraband  contribution  to  damping.  Damping  constants 
were  recently  calculated  using  the  breathing  Fermi  surface 
model  [  1 2,2 1  ]  by  evaluating  the  derivative  of  the  electronic 
energy  with  respect  to  the  spin  direction  according  to 
Eq.  (2).  The  results  of  the  breathing  Fermi  surface  predic¬ 
tion  are  indistinguishable  from  the  intraband  terms  of  the 
present  calculation  even  though  the  computational  ap¬ 
proaches  differed  significantly;  the  agreement  is  quantified 
in  Table  I. 

The  breathing  Fermi  surface  model  could  not  be  quanti¬ 
tatively  compared  to  the  experimental  results  because  the 
temperature  dependence  of  the  scattering  rate  has  not  been 
determined  sufficiently  accurately.  While  the  present  cal¬ 
culations  also  require  knowledge  of  the  scattering  rate  to 
determine  the  damping  rate,  the  nonmonotonic  depen¬ 
dence  of  damping  on  the  scattering  rate  produces  a  unique 
minimum  damping  rate.  In  the  same  manner  that  the 
calculated  curves  of  Fig.  1  have  a  minimum  with  respect 
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FIG.  1 .  Calculated  Landau-Lifshitz  damping  constant  for  Fe, 
Co,  and  Ni.  Thick  solid  curves  give  the  total  damping  parameter 
while  dotted  curves  give  the  intrahand  and  dashed  lines  the 
interband  contributions.  Values  for  A  are  given  in  SI  units.  The 
right  axis  is  the  equivalent  Gilbert  damping  parameter  and  the 
top  axis  is  the  full  width  half  maximum  of  the  electron  spectral 
functions. 


to  scattering  rate,  the  measured  damping  curves  exhibit 
minima  with  respect  to  temperature.  Whatever  the  relation 
between  temperature  and  scattering  rate,  the  calculated 
minima  may  be  compared  directly  and  quantitatively  to 


the  measured  minima.  Table  I  makes  this  comparison.  The 
agreement  between  measured  and  calculated  values  shows 
that  the  torque-correlation  model  accounts  for  the  domi¬ 
nant  contribution  to  damping  in  these  systems. 

Our  calculated  values  are  smaller  than  the  measured 
values.  Using  measured  g  values  instead  of  setting  g  =  2 
would  increase  our  results  by  a  factor  of  (g/ 2)2,  or  about 
10%  for  Fe  and  20%  for  Co  and  Ni.  Other  possible  reasons 
for  the  difference  include  a  simplified  treatment  of 
electron-lattice  scattering  in  which  the  scattering  rates  for 
all  states  were  assumed  equal,  errors  associated  with  using 
wave  functions  found  from  the  mean-field  local  spin  den¬ 
sity  approximation,  and  numerical  convergence  (discussed 
above).  Additionally,  the  extraction  of  damping  rates  from 
the  measured  linewidths  remains  challenging  for  Fe  and 
particularly  for  Co.  Other  damping  mechanisms  may  also 
make  small  contributions  [22-24]. 

Since  the  manipulations  involved  with  the  equation  of 
motion  techniques  employed  in  deriving  Eq.  (3)  obscure 
the  underlying  physics  we  now  discuss  the  two  scattering 
processes  and  connect  the  intraband  terms  to  the  breathing 
Fermi  surface  model.  The  intraband  terms  in  Eq.  (3)  de¬ 
scribe  scattering  from  one  state  to  itself  by  the  torque 
operator,  which  is  similar  to  a  spin-flip  operator.  A  spin- 
flip  operation  between  some  state  and  itself  is  only  nonzero 
because  the  spin-orbit  interaction  mixes  small  amounts  of 
the  opposite  spin  direction  into  each  state.  Since  the  initial 
and  final  states  are  the  same,  the  operation  is  naturally  spin 
conserving.  The  matrix  elements  do  not  describe  a  real 
transition,  but  rather  provide  a  measure  of  the  energy  of  the 
electron-hole  pairs  that  are  generated  as  the  spin  direction 
changes.  The  electron-hole  pairs  are  subsequently  annihi¬ 
lated  by  a  real  electron-lattice  scattering  event. 

To  connect  the  derivatives  de/dd  in  Eq.  (2)  and  the 
torque  matrix  elements  in  Eq.  (3)  we  imagine  first  pointing 
the  magnetization  in  some  direction  z .  The  only  energy  that 
changes  with  the  magnetization  direction  is  the  spin-orbit 
energy  Hso.  As  the  spin  of  a  single-particle  state  |)  rotates 
along  6  about  x  its  spin-orbit  energy  is  given  by  e(6)  = 
(\eiax0Hsoe~l<Tx0\).  The  derivative  with  respect  to  6  is 
de(0)/d0  =  i(\ei<T'0[(rx,  Hso]e~i(r'e |).  Evaluating  this  de¬ 
rivative  at  the  pole  ( 0  =  0)  gives  de/dd  =  i(\[(rx,  //so]|). 
Similarly,  rotating  the  spin  along  6  about  y  leads  to 


TABLE  1.  Calculated  and  measured  [13]  damping  parameters.  Values  for  A  are  reported  in  109  s_l  while  those  for  a  are 
dimensionless.  Values  in  the  first  four  columns  indicate  minima  of  the  calculated  or  measured  curves.  The  last  two  columns  list 
calculated  damping  due  to  the  intraband  contribution  from  Eq.  (3)  and  from  the  breathing  Fermi  surface  model  [12],  respectively. 
Values  for  A/r  are  given  in  1022  s-2.  Published  numbers  from  [12,13]  have  been  multiplied  by  An  to  convert  from  the  cgs  unit  system 
to  SI. 


**calc 

Acalc 

^meas 

^calc  /  ^meas 

(^/r)intra 

(V  7)bfs 

bcc  Fe  (001) 

0.0013 

0.54 

0.88 

0.61 

1.01 

0.968 

bcc  Fe  (111) 

0.0013 

0.54 

1.35 

1.29 

hep  Co  (0001) 

0.0011 

0.37 

0.9 

0.41 

0.786 

0.704 

fee  Ni  (111) 

0.017 

2.1 

2.9 

0.72 

6.67 

6.66 

fee  Ni  (001) 

0.018 

2.2 

8.61 

8.42 
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de/dd  =  i(\[crx,  //so]|).  The  torque  matrix  elements  in 
Eq.  (3)  are  =  <|[er“,  Hso]|)  =  (|[ert,  //so]|>  - 

i(\[(Ty,  //so]|).  Using  the  relations  between  the  commuta¬ 
tors  and  derivatives  just  found  the  torque  is  = 
—  i(de/dd)x  —  (de/dd)y,  where  the  subscripts  indicate 
the  rotation  axis.  Squaring  the  torque  matrix  elements 
gives  IT"  |2  =  (de/dd)2  +  (de/dd)2,.  For  high  symmetry 
directions  (de/dd)x  =  (de/d0)y  and  we  deduce  |r~|2  = 
2 (de/dd)2  demonstrating  that  the  intraband  terms  of  the 
torque-correlation  model  describe  the  same  physics  as  the 
breathing  Fermi  surface. 

The  monotonically  increasing  curves  in  Fig.  1  indicate 
the  interband  contribution  to  damping.  Uniform  mode 
magnons,  which  have  negligible  energy,  may  induce  qua¬ 
sielastic  transitions  between  states  with  different  energies. 
This  occurs  when  lattice  scattering  broadens  bands  suffi¬ 
ciently  so  that  they  overlap  at  the  Fermi  level.  These  wave 
vector  conserving  transitions,  which  are  driven  by  the 
precessing  exchange  field,  occur  primarily  between  states 
with  significantly  different  spin  character.  The  process  may 
roughly  be  thought  of  as  the  decay  of  a  uniform  precession 
magnon  into  a  single  electron  spin-flip  excitation.  These 
events  occur  more  frequently  as  the  band  overlaps  increase. 
For  this  reason  the  interband  terms,  which  qualitatively 
match  the  resistivitylike  contributions  in  the  experimental 
data,  dominate  damping  at  room  temperature  and  above. 

We  have  calculated  the  Landau-Lifshitz  damping  pa¬ 
rameter  for  the  itinerant  ferromagnets  Fe,  Co,  and  Ni  as  a 
function  of  the  electron-lattice  scattering  rate.  The  intra¬ 
band  and  interband  components  match  qualitatively  to 
conductivity  and  resistivitylike  terms  observed  in  FMR 
measurements.  A  quantitative  comparison  was  made  be¬ 
tween  the  minimal  damping  rates  calculated  as  a  function 
of  scattering  rate  and  measured  with  respect  to  tempera¬ 
ture.  This  comparison  demonstrates  that  our  calculations 
account  for  the  dominant  contribution  to  damping  in  these 
systems  and  identify  the  primary  damping  mechanism.  At 
room  temperature  and  above,  damping  occurs  overwhelm¬ 
ingly  through  the  interband  transitions.  The  contribution  of 
these  terms  depends  in  part  on  the  band  gap  spectrum 
around  the  Fermi  level,  which  could  be  adjusted  through 
doping. 
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A  monolayer  of  live  bacterial  cells  has  been  patterned  onto  substrates  through  the  interaction  between  CFA/I 
fimbriae  and  the  corresponding  antibody.  Patterns  of  live  bacteria  have  been  prepared  with  cellular  resolution  on  silicon 
and  gold  substrates  for  Salmonella  enterica  serovar  Typhimurium  as  a  model  with  high  specificity  and  efficiency. 
The  immobilized  cells  are  capable  of  dividing  in  growth  medium  to  form  a  self-sustaining  bacterial  monolayer  on 
the  patterned  areas.  Interestingly,  the  immobilized  cells  can  alter  their  orientation  on  the  substrate,  from  lying-down 
to  standing-up,  as  a  response  to  the  cell  density  increase  during  incubation.  This  method  was  successfully  used  to 
sort  a  targeted  bacterial  species  from  a  mixed  culture  within  2  h. 


Introduction 

Bacterial  cells  are  ideal  sensors  for  environmental  monitoring 
because  of  their  low  cost,  fast  growth,  rich  genetic  modifications, 
easy  handling,  and  sensitivity  to  a  wide  variety  of  environmental 
stimuli.* 1  Efficient,  controllable  immobilization  of  bacteria  is 
critical  for  the  success  of  such  biosensors.  Such  immobilization 
also  offers  potential  applications  in  biomedical  research  and 
fundamental  bacteriological  studies  such  as  quorum  sensing.  The 
majority  of  reported  immobilization  approaches  utilize  either 
nonspecific  adsorption  of  bacterial  cells  on  chemically  treated 
surfaces  or  physical  entrapment  of  cells  in  gels  or  microholes. 
For  example,  attachment  of  bacteria  has  been  conducted  on 
prefabricated  microarrays  with  microholes  treated  either  with 
poly-L-lysine  ( PLL)2  or  with  /7-hexadecanethiol,3  while  the  areas 
surrounding  these  holes  were  coated  with  a  layer  of  poly( ethylene 
glycol)  (PEG).  Micro-4  and  macrocontact5  printing  have  been 
employed  to  transfer  live  bacteria  onto  the  surface  of  a  nutrient- 
containing  matrix  such  as  agarose  or  hydrogel.  Bacterial 
microarrays  have  also  been  prepared  by  loading  individual 
bacterial  cells  into  microwells  (2.5  pm  wide,  pm  deep)  at 
the  distal  end  of  an  optical  fiber  bundle  by  centrifugation.6  A 
bacterial  array  printed  onto  porous  nylon  has  also  been  reported,7 8 9 
in  which  cells  were  physically  entrapped  in  the  pores  of  a  special 
nylon  substrate  in  close  contact  with  a  nutrient  medium.  Such 
arrays  offer  great  potential  for  monitoring  genotoxicity*  and  heavy 
metals  in  the  environment^  and  for  high  throughput  assays  of 
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gene  expression.10  Very  recently,  Akselrod  et  al.  reported  three- 
dimensional  heterotypic  arrays  of  living  cells  in  hydrogels  created 
by  means  of  high-precision  (submicrometer  accuracy)  time- 
multiplexed  holographic  laser  trapping. 1 1  However,  this  technique 
has  limited  applications  in  practice,  in  that,  besides  the  need  for 
a  trapping  laser,  excessive  exposure  to  laser  light  may  cause 
photodamage  to  the  cells;  furthermore,  arrays  are  expected  to 
merge  in  a  few  hours  because  of  cell  division. 

Another  approach  to  bacterial  immobilization  takes  advantage 
of  the  interaction  between  an  appropriate  antibody— antigen  pair. 
This  approach  has  been  used  in  conjunction  with  microcontact 
printing  and  dip-pen  nanolithography;12  binding  of  Escherichia 
coli  to  microscale  features  was  achieved  through  antibodies 
against  the  w  hole  cell  or  bacterial  flagella.  However,  E.  coli  cells 
showed  a  lower  attachment  to  features  modified  with  antibodies 
than  with  PLL.12  The  poor  immobilization  of  bacterial  cells 
mediated  by  antibody  binding  is  also  evidenced  by  a  report  on 
environmental  toxicity  monitoring  using  immobilized  E.  coli  in 
which  only  2%  surface  coverage  of  the  bacteria  was  achieved.13 
In  other  studies,  antibody-modified  substrates  have  been  used 
for  immobilizing  and  detecting  pathogenic  bacteria,  but  little  has 
been  reported  on  the  cell  density  on  the  substrates.14  The  present 
study,  as  well  as  previous  studies,  indicates  that  antibody  -antigen- 
based  immobilization  does  not  hinder  such  physiological  activities 
as  cell  division  (this  work),  gene  expression,  or  bioluminescence 
of  bacteria  at  the  locations  of  their  immobilization.1213 

In  most  of  these  applications,  limited  attention  has  been  paid 
to  the  efficiency  and  control  of  the  immobilization  of  live  cells 
on  substrates  or  to  the  physiological  activity  of  individually 
immobilized  bacteria.  For  example,  patches  of  bacteria  created 
by  microcontact  printing  will  either  grow  in  lateral  directions  or 
disintegrate  in  a  short  period  of  time  if  exposed  to  a  flow  reactor, 
resulting  in  a  loss  of  functionality  of  the  sensor.  An  efficient, 
reproducible,  stable,  self-sustaining,  and  highly  specific  im¬ 
mobilization  of  bacteria  on  a  predefined  surface  is  necessary.  In 
this  paper,  we  report  such  an  immobilization  of  live  cells  of  S. 

(10)  Van  Dyk,  T.  K.;  DeRose.  E.  J.;  Gonye,  G.  E.  J.  Bacterial  2001. 183  (19), 
5496-5505. 

(11)  Akselrod.  G.  M.;  Timp,  W.;  Mirsaidov,  U.;  Zhao.  Q.;  Li,  C.;  l  imp.  R  ; 
Timp,  K.;  Matsudaira,  P.;  Timp,  G.  Biophys.  J.  2006.  91  (9),  3465-3473. 

(12)  Rozhok,  S.;  Shen,  C.  K.  F.;  Littler,  P.  L.  H.;  Fan,  Z.  F.;  Liu.  C.;  Mirkin, 
C.  A.;  Holz.  R.  C.  Small  2005.  /  (4),  445-451. 

(13)  Premkumar,  J.  R.;  Lev,  O.;  Marks,  R.  S.;  Polyak,  B.;  Rosen.  R  ;  Belkin, 
S.  Talanta  2001.  55  (5),  1029-1038. 

(14)  Oh,  B.  K.;  Kim.  Y.  K.;  Park,  K.  W.;  Lee,  W.  H.;  Choi.  J.  W.  Biosens. 
Bioelectron.  2004.  /9(11),  1497-1504. 


1 0. 102 1/Ia7038653  CCC:  $40.75  ©  2008  American  Chemical  Society 

Published  on  Web  03/06/2008 


4162  Langmuir ,  Vol.  24.  No.  8,  2008 


Suo  el  al. 


Typhimurium  on  well-characterized  material  surfaces;  we  chose 
this  species  because  of  its  zoonotic  properties,  infecting  both 
animals  and  humans,  and  our  desire  to  prevent  such  infec¬ 
tions. 

Experimental  Section 

Bacteria.  In  most  experiments.  Salmonella  enter ica  serovar 
Typhimurium  Aasd: :kanR  H71-pHC  was  used  as  a  model  bacterial 
species  for  immobilization  and  patterning.  We  constructed  a  A asdv. 
kanR  lethal  mutant  from  wild  type  S.  Typhimurium  H71,  termed 
Aasd::kanR  S.  Typhimurium  H71.15  This  lethal  mutant  cannot 
normally  surv  ive  unless  an  asd  gene  is  presented  in  trans.  Plasmid 
pHC,  which  contains  a  chimeric  triple  promoter,  P telA — PpagC^PphoP, 16  ,7 
was  used  to  express  CFA/I  fimbriae.  The  chimeric  triple  promoter 
was  installed  upstream  of  the  cfaf  1  operon  to  enhance  cfafl  expression. 
For  the  sorting  experiment  (Figure  6),  S.  Typhimurium  Aasd::kanR 
H71-(pHC+pGFP)  and  E.  coli  0157:H7  RFP  were  used.  Plasmids 
pHC  and  pGFP  (Clontech,  Mountain  View,  CA)  were  used  to  express 
CFA/I  fimbriae  and  green  fluorescence  protein  (GFP),  respectively. 
E.  coli  0157:H7  RFP  expressing  red  fluorescence  protein  (RFP),K 
was  obtained  from  Dr.  T.  Khan  and  Dr.  B.  Klayman  at  the  Center 
for  Biofilm  Engineering,  Montana  State  University. 

Frozen  bacteria  stock  at  —80  °C  was  inoculated  onto  a  Luria— 
Bertani  (LB)  plate  and  incubated  at  37  °C  overnight.  The  bacteria 
were  then  inoculated  into  an  LB  liquid  medium  without  antibiotics 
and  shaken  at  125  rpm  at  37  °C.  The  bacterial  cells  were  harvested 
when  the  optical  density  of  the  medium  at  600  nm  (OD600)  reached 
about  0.5— 0.6,  which  corresponds  to  a  colony  forming  unit  (CFU) 
value  of  ^9.0  x  10K/mL. 

Antibody.  The  anti-CFA/I  serum  was  prepared  by  immunizing 
a  rabbit  intramuscularly  (im)  with  purified  CFA/I  fimbriae  proteins. 
Four  weeks  post  immunization,  the  rabbit  was  bled  to  check  the 
serum  anti-CFA/I  titers  using  an  enzyme-linked  immunoadsorbent 
assay  (ELISA).  Serum  IgG  was  further  purified  with  the  protein  G 
column  to  remove  the  nonspecific  serum  protein.  This  antibody  was 
diluted  to  100  times  with  phosphate-buffered  saline  (PBS)  (pH  = 
7.4)  before  use. 

Chemicals.  PBS  buffer  salt,  3-aminopropyltriethoxysilane 
(APTES),  and  1 1 -mercaptoundecanoic  acid  (11-MUDA)  were 
purchased  from  Sigma-Aldrich  (St.  Louis,  MO).  N-[/?-Maleimi- 
dopropyloxy]-succinimide  ester  (BMPS),  l-ethyl-3-[3-dimethy- 
laminopropyl]carbodiimide  hydrochloride  (EDC),  and  W-hydrox- 
ysuccinimide  (NHS)  were  purchased  from  Pierce  Biotechnology 
(Rockford,  IL).  Bacterial  viability  dyes  (Live/Dead  BacLight)  were 
purchased  from  Invitrogen  (Carlsbad,  CA).  HS(CH2)i  i(OCH2CH2)3- 
OH  (SPT-11)  was  purchased  from  Sensopath  Technology,  Inc. 
(Bozeman,  MT),  and  2-[methoxy(polyethyleneoxy)-propyl]tri- 
methoxysilane  (PEG-silane)  was  purchased  from  Gelest,  Inc. 
(Morrisville,  PA). 

Substrate  Passivation.  Gold.  Gold-coated  silicon  chips  were 
cleaned  in  a  chloroform  bath  and  ozone  plasma  chamber,  rinsed 
with  100%  ethanol,  and  then  incubated  in  a  2  mM  solution  of  SPT- 
11 ,g  in  100%  ethanol  overnight.  After  being  rinsed  with  copious  (5 
x  1  mL)  ethanol  and  water,  the  modified  chips  were  dried  with 
nitrogen. 

Silicon.  Silicon  chips  cleaned  by  ozone  plasma  were  heated  in  a 
solution  of  PEG-silane20  (2%,  v/v)  in  isopropanol  at  60  °C  for  70 
min,  rinsed  with  copious  (5x1  mL)  100%  ethanol,  and  dried  with 
nitrogen. 
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Patterning  Substrates  Using  a  Microfocused  Ga  Ion  Beam. 

The  substrate  was  patterned  using  a  Ga"  beam  of  a  timc-of- flight 
secondary  ion  mass  spectrometry  (ToFSIMS)  system  (TRIFT  I,  PHI- 
Evans,  Chanhassen,  MN).21  The  microfocused  Ga+  beam  provides 
a  1 .3  nA  DC  current  at  ~~  1 5  keV  beam  energy.  The  sample  potential 
for  positive  ions  was  kept  at  ~3  kV;  hence,  the  primary  ion  impact 
energy  was  ~  1 2  keV.  The  direct  Ga^  ion  current  at  the  target  position 
was  measured  to  be  ^  1 .26  nA.  The  etching  time  was  carefully 
adjusted  so  that  only  a  very  thin  layer  of  the  substrate  surface  was 
removed  (<5  nm).  To  the  best  of  our  knowledge  this  is  the  first 
report  on  the  preparation  of  bacterial  cell  patterns  using  a  focused 
ion  beam. 

Covalent  Linking  of  Antibody.  Gold.  Etched  gold  substrates 
were  incubated  in  a  2  mM  solution  of  1 1-MUDA  in  100%  ethanol 
overnight.  After  being  rinsed  with  copious  (5x1  mL)  ethanol  and 
water,  the  modified  chips  were  treated  with  a  NHS/EDC  solution 
(NHS,  3  mg/mL;  EDC,  2  mg/mL)  for  1  h  at  ambient  temperature 
and  further  rinsed  with  PBS  buffer. 

Silicon.  Etched  silicon  substrates  were  incubated  in  a  solution  of 
APTES  in  methanol  (2%)  for  10  min  and  rinsed  with  copious  (5  x 
1  mL)  ethanol  before  being  further  incubated  in  a  solution  of  BMPS 
in  anhydrous  acetonitrile  (10  mM)  for  30  min  at  room  temperature. 
The  chips  were  rinsed  with  acetonitrile  and  dried  in  air. 

Antibody'  Linking.  The  activated  substrates  were  incubated  with 
antibody  solution  for  1  h  at  room  temperature  and  rinsed  with  PBS 
buffer  to  remove  the  free  antibody  molecules.  These  antibody- 
modified  chips  were  then  used  for  bacterial  immobilization. 

Immobilization  of  Bacterial  Cells.  Substrates  with  antibody 
patterns  were  incubated  with  a  suspension  of  live  bacterial  cells  in 
half  LB  growth  medium  (LB  medium  diluted  with  1  %  NaCl  solution 
by  a  ratio  of  1:1,  v/v)  at  room  temperature  for  3  h.  Some  samples 
were  also  incubated  at  37  °C  for  3  h  and  showed  no  obvious  difference. 
After  incubation,  the  samples  were  gently  rinsed  with  copious  PBS 
buffer  (>5  mL)  to  remove  the  planktonic  or  loosely  attached  cells. 
The  rinsed  samples  were  kept  in  PBS  buffer  or  growth  medium  at 
4  °C  for  further  analysis. 

Optical  Imaging  of  Immobilized  Cells.  Optical  imaging  was 
done  using  either  an  Olympus  BX6 1  or  a  Leica  TCS  SP2  microscope. 
All  of  the  images  were  recorded  in  reflection  mode  (Figures  2,  4, 
and  5A,B)  or  in  fluorescence  mode  (Figures  5C,D  and  6).  For 
reflection  mode  imaging,  the  samples  were  imaged  using  water 
immersion  objective  lenses  in  PBS  buffer  or  1  /2  LB  growth  medium 
without  staining  the  bacterial  cells.  The  bright  background  color  of 
the  patterned  areas  in  Figures  2,  4,  and  5A,B  is  due  to  the  Ga+ 
focused  ion  beam  etching  of  the  substrate  surfaces.  For  fluorescence 
mode  imaging,  the  cells  were  stained  using  Live/Dead  BacLighx 
according  to  the  protocol  suggested  by  the  dye  manufacturer,  and 
imaged  in  a  PBS  buffer  using  water  immersion  objective  lenses. 

X-ray  Photoelectron  Spectroscopy  (XPS).  XPS  analysis  was 
conducted  using  a  Physical  Electronics  5600ci  system  equipped 
with  monochromatized  Al  Ka  X-rays.  Data  acquisition  and  data 
analysis  were  performed  using  RBD  AugerScan2  software. 

Atomic  Force  Microscopy  (AFM).  Images  were  obtained  in 
tapping  mode  in  air  using  a  Multimode  V  atomic  force  microscope 
from  Veeco  (Santa  Barbara,  CA)  with  an  E-  or  J-type  scanner  and 
using  NSC  18  AFM  probes  from  MikroMasch  (Wilsonville,  OR). 

Field  Emission  Scanning  Electron  Microscopy  (FESEM). 
Immobilized  and  planktonic  bacteria  deposited  on  a  clean  silicon 
wafer  were  imaged  using  a  Zeiss  SUPRA  55VP  system  (Carl  Zeiss, 
Germany). 

Results  and  Discussion 

Figure  1  shows  the  method  for  immobilizing  and  preparing 
microscale  patterns  of  live  bacterial  cells.  Depending  on  the 
choice  of  substrate,  a  clean  silicon  wafer  or  a  gold  substrate 
surface  is  first  passivated  using  a  PEG  layer  to  prevent  nonspecific 
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Figure  1.  Micropatteming  of  live  bacterial  cells.  The  substrate  was 
first  modified  with  chemicals  that  inhibit  the  nonspecific  adsorption 
of  proteins  (blue  bars)  and  then  etched  using  a  programmable  focused 
Ga+  ion  beam.  The  freshly  etched  surface  was  then  modified  with 
a  cross-linker  (orange  bars)  to  link  the  antibody  (cyan  Y  shapes), 
raised  against  the  bacterial  surface  antigens,  to  the  substrate.  When 
the  patterned  substrate  was  incubated  with  the  bacterial  suspension, 
the  bacterial  cells  adhered  only  to  the  antibody-modified  area  and 
thus  formed  a  monolayer  of  bacterial  cell  patterns. 

adsorption  of  the  antibody. ig*22  The  substrate  is  then  patterned 
using  a  focused  Ga^  ion  beam  to  remove  the  preselected  portions 
of  the  passive  layer  from  the  substrate.  This  is  followed  by 
attaching  one  end  of  a  cross-linker  molecule  to  the  sputtered  area 
of  the  substrate  and  the  other  end  to  an  antibody  molecule,  all 
via  covalent  bonds.  For  silicon,  trialkoxysilanes  are  used  to  couple 
the  cross-linkers  and  PEG  moieties  to  the  substrate,  and  for  gold 
substrates  thiol-based  compounds  are  used.  After  rinsing  with 
PBS  buffer  to  remove  the  excess  antibody  from  the  medium,  the 
antibody-patterned  substrate  is  incubated  with  a  bacterial 
suspension  at  ambient  conditions  for  a  period  of  about  3  h.  The 
bacterial  cells  attach  only  to  the  antibody-modified  areas,  so  that 
a  micropattem  of  bacterial  cells  is  achieved. 

Optical  images  of  bacterial  patterns  on  gold  and  silicon 
substrates  obtained  while  the  bacteria  were  alive  are  shown  in 
Figure  2.  A  sharp  contrast  between  the  patterned  area  and  the 
PEG-passivated  area  is  clearly  demonstrated  in  these  images. 
Only  sparsely  attached  cells  arc  observed  in  the  PEG-passivated 
area,  and  all  of  the  patterned  area  is  occupied  by  a  dense  monolayer 
of  S.  Typhimurium  cells.  To  obtain  such  high-quality  micro- 
patterns  of  bacterial  cells,  attention  must  be  paid  to  surface 
preparation  and  substrate  cleanness  during  the  passivation  step 
(Figure  3).  Flaws  in  surface  passivation  can  cause  failures  of 
patterning.  The  failure  of  passivation  facilitates  antibody  adsorp¬ 
tion  everywhere  on  the  surface,  leading  to  indiscriminate 
immobilization  (Figure  S-l  in  the  Supporting  Information). 

The  attachment  of  antibody-immobilized  cells  was  robust 
enough  to  resist  washing  with  copious  buffer  solution  for  all 
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strains  tested.  In  some  experiments,  the  antibody  molecules  were 
allowed  to  adsorb  noncovalently  onto  the  patterned  (unpassivated) 
areas  without  the  cross-linker.  Bacterial  patterns  generated  in 
this  way  were  still  resistant  to  the  wash  with  PBS  buffer  (though 
not  as  strong  as  covalently  coupled  antibodies),  indicating  that 
the  attachment,  while  not  as  strong,  was  robust  enough  for  bacterial 
immobilization.  This  is  important  in  practice  and  avoids  the  extra 
step  of  coupling  the  antibodies  covalently  to  the  substrates.  Figure 
2A  is  an  example  of  such  immobilization. 

The  immobilized  cells  remain  viable  for  at  least  6  h  in  PBS 
buffer  when  stored  at  4  °C,  as  evidenced  by  the  fluorescence  of 
the  immobilized  cells  after  being  stained  with  viability  dyes 
(Figure  5C).  The  viability  of  the  immobilized  cells  in  PBS  buffer 
is  also  supported  by  their  free  motion  around  the  locations  of 
immobilization,  as  shown  in  Movie  S-l  in  the  Supporting 
Information.  In  growth  medium,  the  immobilized  cells  can  last 
for  at  least  3  days  while  retaining  their  physiological  activities, 
and  a  longer  lifetime  can  reasonably  be  expected  for  the 
immobilized  cells,  provided  nutrients  are  available  and  the 
antibody  molecules  remain  active.  Monitoring  the  immobilized 
bacteria  under  optical  microscope  showed  clearly  that  the  cells 
were  alive  and  capable  of  division  in  the  growth  medium  at 
ambient  conditions  (Figure  4  and  Movie  S-2  in  the  Supporting 
Information).  It  is  clear  that  the  bacteria  were  dividing  without 
hindrance  even  though  they  were  tied  to  the  substrate  through 
antibody— antigen  interactions. 

As  seen  in  Figure  2D,  cellular  resolution  of  bacterial  patterning 
was  achieved  in  our  experiments.  Some  of  the  line  thicknesses, 
M  //m,  in  the  patterns  are  comparable  to  the  dimensions  of  the 
bacteria,  and  the  bacteria  are  concentrated  along  these  narrow 
lines  while  very  few  cells  are  observed  outside  the  lines.  In  fact, 
those  few  cells  immobilized  outside  the  patterned  areas  arc  also 
most  likely  immobilized  through  antibody— antigen  interactions. 
Passivation  is  not  perfect:  some  antibodies  find  their  way  to  the 
substrate  on  the  defects  of  the  passive  layer.  These  antibodies 
outside  the  patterned  areas  become  centers  of  attachment  for  the 
motile  bacteria  that  happen  to  be  at  these  locations. 

Poly-L-lysine  has  been  successfully  used  to  immobilize  E. 
coli  K-12  strains  on  abiotic  substrates212  as  well  as  to  attach 
eukaryotic  cells  to  glass  slides.23*24  However,  it  failed  to 
immobilize  all  the  bacterial  species  we  tested,  including  E.  coli , 
S.  Typhimurium,  and  Haemophilus  influenzae.  Other  approaches 
to  immobilizing  bacterial  cells,  such  as  an  amino-terminated 
surface,25  a  positively  charged  polymer,26  a  gelatin-coated 
surface,27  and  a  direct  linking  of  bacterial  cells  via  covalent 
bonds,2*  proved  to  be  unreliable,  inefficient,  and  irreproduciblc 
for  the  strains  we  tested.  The  difficulties  associated  with  these 
techniques  can  partially  be  explained  by  the  fact  that  bacterial 
cells  have  a  very  small  contact  area  with  the  substrate  surface 
due  to  their  smaller  size  relative  to  eukaryotic  cells,  typically  by 
1  —2  magnitude  orders.  Furthermore,  a  forest  of  appendages, 
such  as  pili  and  flagella,  protruding  out  of  the  surface,  as  shown 
in  the  inset  of  Figure  2A,  prevent  bacteria  from  contacting  the 
surface.  Additionally,  many  bacterial  species,  including  S. 
Typhimurium,  have  a  layer  of  extracellular  polymeric  substances 
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Figure  2.  Micropattems  of  live  5.  Typhimurium  cells  immobilized  on  substrates  etched  by  a  focused  ion  beam.  (A)  Square  pattern  on  gold. 
The  inset  in  (A)  shows  a  high-resolution  atomic  force  microscope  image  of  a  typical  S.  Typhimurium  Aasd::kanR  H71-pHC,  obtained  in 
air,  revealing  the  CFA/1  fimbriae  (scale  bar:  1  jam).  (B)  Line  pattern  on  silicon.  (C)  Bobcat  mascot  and  MSU-ICAL  logo  on  silicon.  (D) 
Enlargement  of  the  area  within  the  white  dotted  box  in  (C),  demonstrating  the  cellular  resolution  of  cell  patterning. 


surrounding  the  bacteria  (capsular  EPS)29  which  prevents  the 
bacteria  from  direct  contact  with  the  surface. 

The  choice  of  antibody  is  critical  to  the  success  of  bacterial 
immobilization.  The  purity  of  the  antibody  is  crucial;  an  affinity- 
purified  antibody  is  preferred,  because  the  serum  contains  a 
considerable  amount  of  proteins  other  than  the  desired  antibody, 
for  example,  serum  albumin,  which  will  compete  for  the  bioactive 
cross-linker  sites  on  the  surface  during  the  covalent  linking  of 
antibody  proteins  to  the  substrate.  This  will  then  mostly  coat  the 
surface  with  serum  albumin,  reducing  the  fraction  of  the  surface 
covered  with  antibody  molecules.  Therefore,  the  use  of  a  serum 
without  affinity  purification  will  yield  a  low  density  of  antibody 
molecules  on  the  substrate  and  thus  give  rise  to  a  low  density 
of  immobilized  bacteria.  This  assumption  is  supported  by  our 
early  attempts  to  immobilize  bacterial  cells  through  unpurified 
anti-CFA/1  serum,  which  only  afforded  unreliable  poor  im¬ 
mobilization.  The  efficiency  of  bacterial  immobilization  also 
depends  on  the  choice  of  antibody-antigen  pair.  Bacterial  species 
express  a  large  number  of  surface  antigens  which  play  various 
roles  in  bacterial  virulence  and  adhesion.30  Antibodies  against 
many  of  these  surface  antigens  are  commercially  available,  but 
we  recommend  that  those  appendages  that  protrude  outside  the 
cell  walls  and  do  not  have  rapid  movements  independent  of  the 
bacteria  be  considered.  For  example,  anti-flagellin  is  not  a  good 


choice  for  leashing  bacteria  to  substrates  because  flagella,  as  the 
bacterial  motility  organ,  rotate  with  very  high  speeds,  exceeding 
10  000  rpm  at  35  °C,31  vastly  reducing  the  chance  of  antibody- 
antigen  interactions.  The  interaction  between  an  antibody— antigen 
pair  is  fairly  weak:  a  force  of  ^50  pN  is  required  to  break  the 
antibody— antigen  interaction,32  which  is  more  than  1  order  of 
magnitude  smaller  than  a  covalent  bond  (>  1  nN).  At  this  time, 
we  do  not  know  how  much  force  a  flagellum  imparts  to  a  S. 
Typhimurium  cell,  but,  to  immobilize  a  bacterium,  we  hypothesize 
at  least  one  antibody— antigen  interaction  is  required  to  hold  the 
bacterium  in  place.  For  the  reasons  discussed  above,  in  our 
experiments,  the  CFA/I  fimbriae  were  chosen  as  the  target  antigen 
because  (1)  they  are  long  and  protrude  outside  the  cell  body 
(inset  of  Figure  2 A),  (2)  the  fimbriae  are  expressed  in  S. 
Typhimurium  strains  in  abundance,  and  (3)  these  appendages  do 
not  rotate  rapidly  independently  of  the  bacteria  as  do  the  flagella. 
A  typical  S.  Typhimurium  cell  will  fit  into  a  2  x  1  //m2  area 
(inset  Figure  2A),  which  predicts  a  maximum  packing  density 
of  about  one  bacterium  per  2  jum2.  This  value  is  in  good  agreement 
with  our  observations  conducted  under  optical  microscope  ( F igure 
2A,C  and  the  Supporting  Information). 

The  unique  features  of  the  bacterial  cells  patterned  in  this  way 
are  the  stability  and  self-sustaining  capability  of  the  patterns.  As 
shown  in  Figure  4  and  Movie  S-2  in  the  Supporting  Information, 
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Figure  3.  Characterization  of  silicon  and  gold  substrates.  (A)  XPS  spectra  of  silicon  chips  (i)  modified  with  APTES;  (ii)  modified  with 
APTES  and  BMPS;  and  (iii)  modified  with  antibody.  No  Nls  signal  was  observed  for  unmodified  silicon,  and  both  Cls  and  Nls  signals 
increased  as  APTES,  BMPS,  and  antibody  were  linked  to  the  silicon  surface  (from  i  to  iii),  implying  that  the  antibody  was  successfully  bonded 
to  the  silicon  surface  with  APTES/BMPS  as  the  cross-linkers.  (B)  XPS  spectra  of  silicon  chips  (i)  cleaned  by  O3  plasma;  (ii)  passivated  using 
PEG-silane;  and  (iii)  passivated  and  incubated  with  anti-CFA/I  antibody.  The  increase  of  the  Cls  peak  from  (i)  to  (ii)  suggests  the  success 
of  the  passivation  of  the  silicon  surface  using  PEG-silane,  while  the  absence  of  the  Nls  peak  in  (iii)  confirms  the  resistance  against  antibody 
adsorption  of  the  passivated  silicon  surface.  (C)  XPS  spectra  of  gold  substrates  (i)  modified  with  1 1  -MUDA  and  (ii)  with  anti-CFA/I  antibody 
cross-linked.  For  the  gold  surface,  Cls,  Nls,  and  Ols  signals  were  used  to  monitor  the  substrate  modification.  The  prominent  Nls  signal 
in  (ii)  and  the  increase  of  Ols  from  (i)  to  (ii)  indicate  the  attachment  of  the  antibody.  (D)  XPS  spectra  of  gold  substrates  (i)  cleaned  by 
O3  plasma;  (ii)  passivated  with  SPT-1 1;  and  (iii)  passivated  and  incubated  with  anti-CFA/I  antibody.  The  increases  in  the  Cls  and  Ols 
peaks  from  (i)  to  (ii)  suggest  the  success  of  the  passivation  of  the  gold  surface  using  SPT-1 1,  while  the  absence  of  the  Nls  peak  in  (iii) 
confirms  the  resistance  against  antibody  adsorption  of  the  passivated  gold  surface.  (E)  AFM  image  of  antibody  molecules  covalently  linked 
to  the  silicon  surface.  Antibody  molecules  linked  to  the  gold  surface  were  not  imaged  by  AFM  because  of  the  interference  from  the  gold 
nanograins. 
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Figu  re  4.  ( A )  Selected  time  lapse  images  over  an  incubation  period 
of 270  min,  showing  the  regeneration  of  immobilized  S.  Typhimurium 
on  a  patterned  silicon  substrate  that  was  incubated  in  growth  medium 
at  room  temperature.  The  red  ovals  highlight  the  dividing  cells.  (B) 
Time  lapse  image  at  270  min  showing  that  newly  divided  cells  only 
occupy  the  available  patterned  areas  and  that  excess  cells  are  released 
into  the  growth  medium  and  can  be  removed  by  gently  rinsing. 

the  immobilized  cells  are  capable  of  dividing  and  regenerating. 
The  daughter  cells  either  stay  attached  to  the  antibody-modified 
area  or  enter  the  growth  medium  after  the  antibody-modified 
area  is  fully  occupied.  Therefore,  the  patterns  are  always  covered 
by  a  monolayer  of  live  cells  and  do  not  change  physical  dimensions 
as  a  result  of  cell  division.  In  a  flow  reactor,  cells  released  into 
the  liquid  phase  will  be  carried  away  and  the  self-refreshing 
bacterial  pattern  can  last  as  long  as  the  antibody  remains  active, 
provided  that  the  medium  has  the  nutrients  to  sustain  life.  This 
is  particularly  useful  for  those  applications,  such  as  water  quality 
control,  that  require  continuous  monitoring  of  pollutants  as  an 
environmental  sensor.  These  features  are  not  available  for  patterns 
prepared  by  mechanically  transferred  cells  on  an  agar  plate;4*5 


for  sensors  fabricated  by  incorporating  cells  in  alginate,35  agar,34 
or  sol-gel  matrices;35  or  for  arrays  constructed  by  physical 
entrapment  into  microwells.8  For  these  patterns,  bacteria  are  held 
on  the  substrates  or  inside  the  microwells  with  weak  forces,  and 
the  cells  do  not  stay  fixed  to  the  patterned  areas  for  long  periods 
of  time.  These  patterns  will  eventually  degrade  and  disintegrate 
when  exposed  to  a  liquid  medium  or  a  flow  reactor. 

The  observation  of  patterned  bacteria  on  antibody-modified 
substrates  allows  probing  of  the  individual  or  collective  behavior 
of  the  bacteria.  For  example,  we  can  follow  the  increase  in  surface 
population  density  of  bacteria  incubated  in  a  growth  medium. 
Such  observation  in  our  experiments  yielded  a  surprising  result, 
as  shown  in  Figure  5A,B,  in  which  bacteria  immobilized  in  a 
lying-down  orientation  took  a  standing-up  orientation  as  their 
density  increased.  This  standing-up  orientation  of  the  crow  ded 
cells  was  also  confirmed  by  laser  scanning  confocal  microscopy 
(LSCM)  images  (Figure  5  C,D).  Although  this  behavior  is  not 
well  understood  at  this  time,  it  might  be  related  to  the  depletion 
of  nutrients  at  the  crowded  bacterial  positions  and  to  the  struggle 
of  the  bacteria  to  move  away  from  their  immobilized  positions. 
This  hypothesis  is  supported  by  the  observation  of  an  excess 
number  of  flagella  produced  by  the  immobilized  bacteria 
(Supporting  Information  Figure  S-2),  presumably  in  an  effort  to 
free  themselves  from  their  positions. 

The  use  of  purified  antibodies  targets  specific  antibody —antigen 
interactions;  hence,  only  bacteria  with  the  targeted  antigens  are 
immobilized.  This  specificity  of  immobilization  can  be  used  in 
many  applications,  including  sorting  organisms  from  a  mixed 
culture.  A  successful  sorting  of  S.  Typhimurium  (green)  from 
a  culture  mixed  with  E.  coli  (red)  is  demonstrated  in  Figure  6. 
The  mixture  (Figure  6A)  was  allowed  to  interact  at  ambient 
conditions  for  only  2  h  with  a  substrate  patterned  in  a  checkerboard 
geometry  with  antibodies  specific  to  S.  Typhimurium.  Figure  6B 
shows  only  the  desired  organism  ( S .  Typhimurium)  immobilized 
on  the  antibody  patterns:  E.  coli  cells  were  washed  off  the 
substrate.  Sorting  a  specific  bacterial  strain  typically  takes  weeks 
to  months,  because  the  mixed  culture  is  subjected  to  the  repeated 
inoculation  and  growth  of  sequentially  diluted  cultures  on 
preselected  agar  plates.36 

Could  bacteria  be  linked  directly  to  a  substrate  without  a  need 
for  an  antibody?  The  advantage  of  this  method,  if  successful, 
would  be  to  avoid  the  necessity  of  finding  an  antibody  for  a 
given  bacterial  strain.  The  disadvantage  would  be  that  all  the 
proteins,  and  possibly  other  biomolecules,  in  the  medium  would 
have  a  chance  at  immobilization;  hence,  the  bacterial  specificity 
would  be  lost.  We  tested  this  idea  by  linking  bacterial  cells  directly 
to  a  substrate  through  covalent  coupling  between  bacterial 
membrane  proteins  and  cross-linkers  on  the  substrate,  as  was 
done  with  the  antibodies.  The  activated  substrates,  with  either 
carboxyl  groups  or  maleimido  groups,  were  directly  incubated 
with  a  suspension  of  bacterial  cells  in  the  growth  medium  and 
also  in  the  PBS  buffer  solution  for  ~15  h  under  ambient 
conditions.  The  results  (not  shown)  indicate  only  sparsely  attached 
cells  populating  the  surface.  We  hypothesize  the  reason  for  the 
failure  of  direct  immobilization  is  that  the  activated  surface  was 
blocked  by  proteins  in  the  bacterial  suspension  before  the  bacterial 
cells  could  reach  the  surface.  These  proteins  are  components  of 
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Figure  5.  Immobilized  S.  Typhimurium  cells  change  their  orienta¬ 
tions  according  to  the  surface  population  density  on  the  substrate. 
Images  (A)  and  (B)  correspond  to  a  sample  incubated  in  growth 
medium  at  37  °C  for  3  and  1 5  h,  respectively.  Notice  that  while  the 
cells  are  lying  down  in  part  (A),  they  appear  to  stand  up  in  (B),  when 
the  surface  is  crowded  (see  Figure  5D).  LSCM  image  of  a  sample 
incubated  in  growth  medium  for  1 5  h  and  then  stored  in  PBS  buffer 
at  4  °C  for  6  h  and  stained  with  viability  stains  (C)  indicating  that 
the  majority  of  the  cells  were  alive.  The  reconstituted  Z-section 
image  of  the  immobilized  cells  (D)  further  confirms  that  a  majority 
of  the  cells  took  a  standing-up  orientation. 
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Figure  6.  Sorting  S.  Typhimurium  cells  from  a  mixture  of  S. 
Typhimurium  and  E.  coli.  (A)  Epifluorescence  image  of  a  mixed 
culture  of  S.  Typhimurium  expressing  GFP  and  E.  coli  expressing 
RFP.  (B)  Epifluorescence  image  of  the  sorted  cells  on  silicon  using 
a  checkerboard  microarray  pattern  of  an  antibody  highly  specific  to 
the  CFA/I  fimbriae  of  S.  Typhimurium. 

Summary 

Our  results  demonstrate  that  cellular  resolution  can  be  achieved 
for  the  preparation  of  self-renewing  bacterial  patterns  on  abiotic 
substrates  through  antibody— antigen  binding  with  high  efficiency 
and  controllability.  The  efficiency  of  the  immobilization  is 
attributed  to  the  careful  selection  and  assembly  of  the  substrate 
surface  chemistry  at  each  step  and  to  the  affinity-purified  antibody 
raised  specifically  against  a  carefully  targeted  antigen  protruding 
outside  the  cell  surface  of  the  bacteria,  the  CFA/1  fimbriae  of  S. 
Typhimurium.  Patterned  cells  remain  viable  under  this  im¬ 
mobilized  condition  and  are  capable  of  reproducing.  The  technique 
is  readily  applicable  to  sorting  specific  bacteria  from  a  mixed 
culture  within  hours  as  compared  to  the  standard  bacteria 
purification  methods,  which  typically  take  days. 

Acknowledgment.  This  work  is  funded  by  NASA-EPSCOR 
under  Grant  NCC5-579,  by  MSU  CBIN,  by  U.S.  Public  Service 
Grant  AI-41 123,  and  by  Montana  Agricultural  Station  and  USDA 
Formula  funds.  We  thank  Mr.  M.  Deliorman  for  his  help  on 
TOFS1MS  and  Dr.  B.  Pitts  for  her  help  on  optical  microscope 
imaging. 


the  growth  medium  or  are  secreted  by  the  bacteria  in  the 
suspension.  It  is  difficult  to  remove  these  proteins  completely, 
even  by  washing  the  cells  with  PBS  buffer  repeatedly.  Further¬ 
more,  the  capsular  EPS  layer  surrounding  each  bacterium  would 
prevent  the  bacterial  surface  proteins  from  making  direct  contact 
with  the  activated  substrate  surface. 


Supporting  Information  Available:  Images  of  immobilized  S. 
Typhimurium  cells  and  the  flagella  expressed  by  the  immobilized  cells 
(Figures  S-l  and  S-2),  and  movie  clips  (Movies  S-l  and  S-2)  showing 
the  motion  and  dividing  of  immobilized  cells.  This  material  is  available 
free  of  charge  via  the  Internet  at  http://pubs.acs.org. 
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Functional  analysis  of  hepatitis  B  virus  (HBV)  core  particles  has  associated  a 
number  of  biological  roles  with  the  C  terminus  of  the  capsid  protein.  One  set  of 
functions  require  the  C  terminus  to  be  on  the  exterior  of  the  capsid,  while 
others  place  this  domain  on  the  interior.  According  to  the  crystal  structure  of 
the  capsid,  this  segment  is  strictly  internal  to  the  capsid  shell  and  buried  at  a 
protein-protein  interface.  Using  kinetic  hydrolysis,  a  form  of  protease 
digestion  assayed  by  SDS-PAGE  and  mass  spectrometry,  the  structurally 
and  biologically  important  C-terminal  region  of  HBV  capsid  protein  assembly 
domain  (Cpl49,  residues  1-149)  has  been  shown  to  be  dynamic  in  both  dimer 
and  capsid  forms.  HBV  is  an  enveloped  virus  with  a  T=  4  icosahedral  core  that 
is  composed  of  120  copies  of  a  homodimer  capsid  protein.  Free  dimer  and 
assembled  capsid  forms  of  the  protein  are  readily  hydrolyzed  by  trypsin  and 
thermolysin,  around  residues  127-128,  indicating  that  this  region  is  dynamic 
and  exposed  to  the  capsid  surface.  The  measured  conformational  equilibria 
have  an  opposite  temperature  dependence  between  free  dimer  and 
assembled  capsid.  This  work  helps  to  explain  the  previously  described 
allosteric  regulation  of  assembly  and  functional  properties  of  a  buried 
domain.  These  observations  make  a  critical  connection  between  structure, 
dynamics,  and  function:  made  possible  by  the  first  quantitative  measure¬ 
ments  of  conformational  equilibria  and  rates  of  conversion  between  protein 
conformers  for  a  mega  Dalton  complex. 

©  2007  Elsevier  Ltd.  All  rights  reserved. 
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Introduction 

In  solution,  many  proteins  exist  as  an  ensemble  of 
conformations.  The  population  distribution  of  the 
ensemble  is  dependent  on  the  barriers  between 
energy  minima  on  the  multi-dimensional  energy 
landscape  of  the  protein.  In  recent  years,  the  strong 
connection  between  dynamics  and  function  has  made 
the  study  of  protein  dynamics  a  critical  research 
frontier.1  ’  The  requirement  of  local  conformational 
plasticity  in  biological  function  is  well  documented 
for  large  complexes  such  as  the  ribosome,4  nuclear 
pores, s  and  virus  particles.6,  A  range  of  biophysical 
approaches  can  be  used  to  describe  the  energy  land¬ 
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scape  of  a  protein  (i.e.  the  distributions  of  different 
conformations  and  the  rates  of  exchange),  and  this  has 
been  done  for  a  number  of  monomeric  proteins.6  111 
However,  due  to  technical  challenges,  detailed  map¬ 
ping  is  not  available  for  the  dynamics  of  viruses,  or 
any  other,  megaDalton  complexes.  Enzymatic  hydro¬ 
lysis  has  a  long  history  as  an  effective  tool  for 
distinguishing  between  folded  and  unfolded  proteins 
and  in  the  identification  of  dynamic  regions.  Here  we 
demonstrate  that  when  applied  rigorously,  kinetic 
hydrolysis  allows  the  determination  of  conforma¬ 
tional  equilibria  and  rate  constants  for  megaDalton 
nucleoprotein  structures. 

More  than  350  million  people  suffer  from  chronic 
infection  with  hepatitis  B  virus  (HBV).  It  is  one  of  the 
major  contributing  factors  to  hepatocellular  carci¬ 
noma  and  cirrhosis.  HBV  is  an  enveloped  virus  with 
an  icosahedral  core.  The  majority  of  cores  are 
-350  A  in  diameter11  and  are  comprised  of  120 
core  protein  homodimers  arranged  with  T=4  isco- 
sahedral  symmetry12  (Figure  1).  The  core  protein 
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(a) 


(b) 


(c) 
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Figure  1.  Hepatitis  B  capsid,  (a) 
The  HBV  capsid  has  T = 4  symmetry 
and  is  assembled  from  120  copies  of 
Cpl49  dimer.  A  hexamer  of  dimers 
surrounds  the  quasi-6-fold  (icosahe- 
dral  2-fold)  axis.  The  C  terminus  of 
the  protein  is  located  at  the  5-fold 
and  6-fold  symmetry  axes,  and  in  all 
structural  models  to  date  the  C 
terminus  of  the  protein  projects 
inward.  A  model  of  trypsin  is 
included  for  scale  (blue,  upper 
left),  and  the  observed  cleavage 
site  is  indicated  with  exaggerated 
scale  (green  spheres).  A  scale  bar  of 
7.0  nm  is  shown  in  the  lower-right 
for  (a)  and  (b).  (b)  The  asymmetric 
unit  from  the  capsid  crystal  struc¬ 
ture.  Each  dimer  contains  two  copies 
of  Cpl49,  which  meet  together  with 
a  30  A  tall  helical  interface.  Contact 
between  dimers  occurs  in  a  limited 
region  spatially  centered  around  the 
end  of  helix  a-5.  The  cleavage  site 
at  residues  R127/T128  is  indicated 
by  small  green  spheres  and  residue 
LI  40  on  the  C-terminal  coil  is  shown 
where  it  meets  FI  10  and  Y118  as 
part  of  the  hydrophobic  interactions 
stabilizing  the  folded  conformation 
observed  in  the  crystal  structure,  (c) 
The  amino  acid  sequence  for  Cpl  49. 
The  cleavage  site  is  shown  in  green, 
and  a  putative  ubiquitin  ligase 
binding  site  is  indicated  in  blue. 


(HBcAg)  has  183  amino  acids  and  can  be  divided 
into  an  assembly  domain,  residues  1-149  (Figure 
1(c)),  and  a  C-terminal  34  amino  acid  RNA-binding 
domain.  Only  the  assembly  domain  of  core  protein 
is  required  to  form  the  capsid.  HBV  also  produces  a 
variant  of  the  protein  (HBeAg)  that  includes  an 
additional  ten  amino  acids  at  the  N  terminus  but 
terminates  at  position  149.  The  149  residue  con¬ 
struct  from  residues  1  to  149  (Cpl 49),  lacking  the  C- 
terminal  domain,  forms  well-ordered  structures13  14 
that  are  indistinguishable  from  the  cores  of  native 
virions.1'  The  assembly  domain  dimer  (Cpl492)  is 
the  basic  unit  of  capsid  assembly.  It  forms  spon¬ 
taneously  at  translation ]h  and  requires  a  denatur¬ 
ing  concentration  of  chaotrope  to  dissociate.1'  The 
structure  of  the  dimer  is  dominated  by  the  inter¬ 
face  between  monomers,  a  four-helix  bundle  which 
projects  30  A  from  the  plane  of  the  capsid  surface 
(Figure  1(a)),  and  is  comprised  of  a  helix-tum- 
helix  from  each  monomer  (Figure  1(b)).13,14,18'19  It 
should  be  noted  that  all  structural  models  of 
Cpl492  are  based  on  intact  particles.  Cpl492  can 
be  expressed  as  a  recombinant  protein  in  Escher¬ 
ichia  coli,  where  it  spontaneously  forms  capsids: 
these  capsids  can  be  dissociated  into  dimers  and 
reassembled.20  The  assembly  process  of  recombi¬ 
nant  particles  has  been  well-characterized  from 
both  kinetic  and  thermodynamic  standpoints. 


making  HBV  one  of  the  best  understood  model 
systems  for  studying  virus  capsid  assembly.1  21  24 
Capsid  assembly  is  a  function  of  pH,  temperature, 
and  ionic  strength.2  24  Assembly  models  predict 
that  the  dimer  undergoes  a  transition  between 
inert  and  assembly-competent  conformations.2'  A 
number  of  small  molecules  act  as  effectors  that 
enhance  or  inhibit  the  assembly  process.21  23  31  The 
best  understood  of  these,  HAP1,  enhances  assem¬ 
bly  and  particle  stability,  at  least  in  part,  by 
stabilizing  the  conformation  of  the  C  terminus  of 
the  assembly  domain.14 

Capsid  dynamics  are  likely  critical  to  HBV  func¬ 
tion.  For  example,  the  C-terminal  domain  has  in¬ 
teractions  outside  the  capsid,  playing  a  major  role 
in  intracellular  trafficking,32  even  though  in  struc¬ 
tural  models  it  is  within  the  capsid  interior.13  18  1M 
Phosphorylation  of  the  C-terminal  domain  is 
required  for  nuclear  localization,33-36  while  loss  of 
phosphorylation  correlates  with  particle  secretion/ 
It  has  also  been  suggested  that  a  PPAY  sequence 
near  the  C  terminus  of  the  assembly  domain  is 
critical  for  trafficking  to  the  endoplasmic  reticulem 
(ER).36  These  results  all  suggest  that,  at  least 
transiently,  the  C-terminal  domain  of  core  protein 
is  exposed  on  the  exterior  of  the  HBV  particle. 
However,  the  C-terminal  domain  is  also  associated 
with  interactions  that  take  place  on  the  capsid 
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interior.  Phosphorylation  of  the  C-terminal  domain 
is  required  for  specific  packaging  of  viral  RNA,39-41 
and  the  pattern  of  phosphorylation  directly  effects 
reverse  transcription,  which  takes  place  within  the 
assembled  capsid.34  41 42  Together,  these  data  create 
a  paradox  where  the  C-terminal  domain  is  localized 
to  the  capsid  interior  but  is  functionally  active  on  the 
exterior.  This  can  be  resolved  if  the  C  terminus  is 
dynamic;  however,  quantitative  evidence  has  thus 
far  been  lacking. 

Kinetic  hydrolysis  is  well  suited  to  the  study  of 
stability  and  dynamics  in  the  context  of  large  com¬ 
plexes  such  as  the  HBV  particle.  In  this  technique,  a 
protein  sample  is  hydrolyzed  using  a  proteolytic 
enzyme  under  carefully  controlled  conditions. 
Based  upon  the  well-established  theory  that  proteo¬ 
lysis  only  occurs  in  exposed  regions  lacking  well- 
defined  structure,43  46  this  technique  has  long  been 
used  to  isolate  individual  domains  for  structural 
analysis,4'  identify  dynamic  protein  regions,48'49 
and  more  recently  as  a  tool  to  quantitatively  probe 
the  high  energy  states  of  proteins.30 

The  fundamental  kinetic  model  involves  dynamic 
motion  between  a  protected  "native"  state  and  a 
partially  unfolded,  "open"  conformation,  which  is 
cleaved  by  protease  at  a  known  rate: 

kopvn  leave 

Closed  <^=>  Open - ►  Cleaved  (1) 

fedose 

By  varying  enzyme  concentration  and  quantifying 
its  effect  on  the  overall  reaction  rate,  the  equilibrium 
and  rate  of  transition  between  closed  and  open 
forms  of  the  protein  can  be  assayed.  By  definition, 
the  protein  in  the  open  conformation  behaves  as  an 
intrinsically  disordered  polypeptide  and  is  cleaved 
by  the  protease  following  standard  Michaelis- 
Menten  kinetics.50  This  gives  rise  to  a  simple  kinetic 
model  where  disappearance  of  intact  protein 
depends  on  the  concentration  of  protein  in  the 
open  conformation,  catalytic  constants  ( kcat/KM ) 
and  enzyme  concentration  (equation  (8)).  Although 
systematically  similar  to  hydrogen-deuterium  ex¬ 
change  (HDX),"1  52  the  scale  of  motion  required 
for  detection  by  proteolytic  hydrolysis  of  folded 
proteins  is  much  greater.  The  exact  deformation 
from  the  protected  "native"  state  will  vary  depend¬ 
ing  on  local  constraints,  but  a  typical  motion  will 
involve  a  substantial  displacement  and  unfolding  of 
any  structure  within  two  to  four  amino  acids  on 
each  side  of  the  cleavage  location.44  This  large-scale 
motion  is  expected  to  be  rare  compared  to  the 
smaller  transitions  required  for  amide  proton 
exchange  in  HDX.  Protein  dynamics  on  the  nan¬ 
ometer  scale  are  known  to  be  involved  in  complex 
formation,53  viral  capsid  breathing49,54-56  and  viral 
docking  and  entry  events. " 

Here  we  present  the  first  quantitative  measure¬ 
ments  of  conformational  equilibria  and  rates  of 
conversion  between  conformers  for  a  megaDalton 
complex,  the  HBV  capsid.  Our  results  indicate  that 
the  C  terminus  of  Cpl49  is  dynamic  and  transiently 
exposed  on  the  surface  of  the  capsid,  in  contrast  to 


its  position  in  crystal  and  cryo-electron  microscopy 
structures.  The  exposure  of  this  region  is  highly 
dependent  on  the  assembly  state  of  the  protein  as 
well  as  the  temperature.  The  enthalpy  and  entropy 
of  the  opening  transition  for  the  C  terminus  reverse 
sign  as  a  function  of  assembly,  identifying  this 
region  as  a  thermodynamic  molecular  switch.  These 
results  suggest  that  regulation  of  assembly  may  in¬ 
volve  the  dynamic  motion  of  the  C-terminal  region 
of  Cpl492. 


Results 

Identification  of  dynamic  domains 

HBV  Cpl49  has  two  assembly  states:  dimer  and 
capsid.  Once  assembled  into  a  capsid,  there  is  a  sub¬ 
stantial  hysteresis  to  disassembly,  making  it  possible 
to  conduct  separate  experiments  on  dimer  and 
capsid  forms  under  identical  solution  conditions.1 
For  proteolytic  reactions,  capsid  and  dimer  were 
diluted  into  reaction  buffer  conditions  immediately 
prior  to  the  initiation  of  the  reaction,  and  the 
reactions  never  exceeded  4  h  in  length.  Using  nearly 
identical  dilution-based  procedures  to  study  disas¬ 
sembly  reactions,  capsids  at  similar  concentrations 
remain  stable  for  at  least  five  days.1  Size-exclusion 
chromatography  and  dynamic  light  scattering  were 
used  to  confirm  that  capsids  remained  intact 
throughout  the  initial  phase  of  digestion,  which 
was  defined  as  the  time  period  with  at  least  70% 
remaining  uncleaved  protein.  Capsids  were  found 
to  be  fully  stable  at  this  level  of  cleavage,  with 
no  detectable  dissociation  (Figure  2(b)).  All  of  the 
kinetic  hydrolysis  experiments  conducted  on  as¬ 
sembled  capsid  were  carefully  designed  to  only 
include  data  collected  before  the  appearance  of 
degraded  particles. 

To  probe  for  dynamic  regions  in  the  core  protein 
and  to  investigate  the  effect  of  particle  assembly  on 
stability  and  dynamics,  we  first  tested  different 
proteases  and  analyzed  samples  from  time-course 
reactions  by  SDS-PAGE.  Both  trypsin  and  thermo- 
lysin  degrade  the  intact  protein  in  a  measurable 
progression  (Figure  3(a)  and  (b),  black  triangles).  In 
the  initial  reaction  phase,  both  proteases  produced  a 
single  primary  product  band  approximately  3  kDa 
smaller  than  the  intact  protein  (Figure  3(a)  and  (b) 
gray  triangles).  A  second  product  band  arose  after 
farther  digestion  that  was  another  ^2  kDa  smaller 
as  judged  by  SDS-PAGE:  this  second  band  has  weak 
intensity  (never  greater  than  10%  total  composition) 
and  has  greatly  delayed  appearance,  especially  for 
assembled  capsid.  These  results  indicated  that  there 
was  a  dynamic  or  unstructured  region  in  the  protein 
and  that  the  initial  reaction  product  was  relatively 
stable.  In  addition,  the  same  site  was  accessible  to 
protease  in  dimer  and  capsid.  Other  cleavage 
products  were  subsequent  to  the  primary  cleavage 
and  evident  only  after  more  aggressive  proteolysis. 
These  late-stage  digestion  products  showed  differ- 
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Figure  2.  HBV  capsid  protein  maintains  integrity 
during  proteolysis,  (a)  A  test  mixture  of  capsid  and 
dimer  stock  separated  with  size  exclusion  chromato¬ 
graphy,  using  the  same  buffer  conditions  as  the  proteolysis 
reactions,  (b)  The  purified  capsid  stock  was  digested  with 
the  standard  protocol  to  the  level  of  70%  remaining 
uncleaved  protein,  then  run  over  the  size  exclusion 
column.  No  dimer  signal  is  seen,  and  the  capsid  intensity 
is  not  diminished  significantly.  The  only  background 
subtraction  used  for  (a)  and  (b)  is  constant  linear 
correction. 


ing  patterns  between  dimer  and  capsid  (data  not 
shown). 

Gel  densitometry  analysis  of  proteolytic  reactions 
revealed  a  consistent  first-order  exponential  decay 
trend  for  the  parent  protein  (Figure  3(c),  circles). 
Proteolytic  cleavage  rate  constants  {kexy))  were 
measured  from  19  °C  to  37  °C  for  both  HBV  capsid 
and  dimer.  The  observed  rate  of  digestion  for  dimer 
is  up  to  20-fold  faster  than  capsid,  necessitating 
adjustments  to  sampling  times  in  order  to  accurately 
characterize  the  first  30%  of  digestion  for  both 
species  (Figure  3(d),  circles).  First-order  behavior 
was  seen  at  all  temperatures  and  the  change  in 
reaction  rate  with  temperature  followed  the  well- 
established  temperature  sensitivity  for  trypsin.58  59 
In  the  presence  of  5.0  gg/ml  of  trypsin  at  25  °C,  the 


observed  rate  constant  for  hydrolysis  of  dimer 
is  approximately  0.013  min  \  with  a  half-life  of 
~53  min. 

The  precise  location  of  proteolytic  cleavage  was 
identified  using  mass  spectrometry.  Matrix-assisted 
laser  desorption  ionization  (MALD1)  and  electro¬ 
spray  ionization  (ESI)  were  employed  to  analyze 
both  quenched  and  online  reactions.  The  only 
significant  peptide  product  mapped  to  a  C-term- 
inal  fragment  of  Cpl49  comprised  of  residues  1 28— 
149  (measured:  2333.263(±0.012)  Da,  predicted: 
2333.263  Da).  Gels  showing  the  intact  protein  and 
primary  cleavage  product  confirm  the  presence  of  a 
species  matching  the  size  of  amino  acid  residues 
1-127  of  Cpl49.  For  quantitative  reactions,  no 
evidence  for  alternative  cleavage  sites  was  found 
by  mass  spectrometry.  The  lack  of  trypsin  cleavage 
sites  within  the  128-149  peptide  rules  out  enzymatic 
competition  from  the  released  product.  Mass  spec¬ 
trometry  and  gel-based  results  are  in  agreement  that, 
during  the  initial  stages  of  digestion  proteolysis 
proceeds  exclusively  via  the  R127/T128  cleavage 
(Figure  1(c)). 

Kinetic  model 

Based  upon  the  observed  first-order  kinetics  of 
proteolytic  cleavage  and  the  single  cleavage  pro¬ 
duct,  we  adopted  the  two-state  model  described  in 
equation  (1)  as  a  working  hypothesis  for  core 
protein  cleavage  (for  detailed  definitions  of  equa¬ 
tions  and  constants  see  Materials  and  Methods). 
Because  trypsin  requires  a  region  of  unfolded  amino 
acids  bordering  the  cleavage  location,44  a  protein 
which  contains  secondary  structure  or  steric  protec¬ 
tion  of  a  cleavage  site  will  be  resistant  to  proteolysis: 
this  is  defined  as  the  "closed"  state.  Random  or 
concerted  motion  has  the  potential  to  transiently 
remove  this  protection  in  the  normal  course  of  a 
protein's  dynamic  behavior  in  solution.  Upon  local 
or  global  unfolding  (de-protection)  the  cleavage  site 
adopts  the  "open"  state  and  is  exposed  to  hydrolytic 
attack.  This  two-state  model  has  had  wide  accep¬ 
tance  for  a  variety  of  protein  systems,50'60  62  and  the 
observed  kinetic  trends  for  Cpl49  validate  the  two- 
state  model  for  this  application  (for  details,  see 
Discussion). 

Because  trypsin  is  highly  specific  in  not  only  the  P 
and  P'  sites  but  also  P2-P3',  the  pentameric  peptide 
Abz-IRTPP-Tyr(3-N02)  was  used  as  a  model  sub¬ 
strate  to  determine  the  A:cleave  constant,  the  intrinsic 
cleavage  constant  for  an  unstructured  peptide  at  a 
given  enzyme  concentration,  under  the  exact  con¬ 
ditions  used  with  Cpl49.  This  constant  contains  a 
^cat/^M  component  that  describes  the  catalytic  ef¬ 
ficiency  on  a  specific  peptide  backbone  in  the  open 
state  (equation  (6)).  By  keeping  substrate  concentra¬ 
tions  sufficiently  low,  /cdeave  is  linearly  dependent  on 
enzyme  concentration.  Fluorometric  assays  of  tryp¬ 
sin  activity  with  the  model  peptide  were  accurate 
at  nanomolar  concentration,  giving  kcM/KM  values 
for  all  appropriate  temperatures  (Table  1).  The 
measured  KM  value  of  20  gM  is  well  above  the  sub- 
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Figure  3.  Hydrolysis  of  HBV  capsid  protein  monitored  with  SDS-PAGE.  (a)  The  progression  of  proteolysis  in  the 
presence  of  high  concentrations  of  enzyme  (37  °C,  dimer).  Trypsin  (open  arrow),  intact  Cpl49  (black  arrow),  and  the 
cleavage  product  (gray  arrow)  are  indicated.  Alternating  time  samples  (5  min  spacing)  are  labeled,  and  first  and  last  lanes 
are  run  in  duplicate,  (b)  Experiments  for  determination  of  observed  rate  constants  (kexp)  are  restricted  to  less  than  30% 
cleavage  (25  °C,  capsid).  Multiple  sampling  replicates  for  each  time  point  are  run  to  estimate  quantitation  error,  (c) 
Quantitation  of  the  bands  shown  in  (a).  The  disappearance  of  the  intact  protein  (circles)  and  appearance  of  product 
(triangles)  follow  first-order  kinetics,  (d)  Gel  band  intensities  from  (b)  were  fit  to  an  exponential  decay  to  determine  the 
rate. 


strate  concentration  in  the  kinetic  proteolytic  assays 
(0.3  nM-300  nM),  which  validates  the  first-order 
simplification  of  Michaelis-Menten  kinetics  and 
justifies  the  treatment  of  kc\eave  in  the  kinetic  model 
as  a  constant  that  scales  linearly  with  enzyme. 
Kinetic  characterization  of  trypsin  using  the  syn¬ 
thetic  peptide  revealed  no  significant  decrease  in 
^cat/^M  over  an  8  h  span,  a  test  for  any  potential 
complication  from  enzyme  self  proteolysis  or  other 


degradation  during  the  course  of  Cpl49  cleavage 
experiments. 

For  both  dimer  and  capsid  forms  of  Cpl49,  the 
maximum  /cexp  rate  constant  observed  with  high 
concentrations  of  protease  (Figure  4(a))  correlated 
with  opening-limited  conditions,  where  the  protein 
is  cleaved  as  soon  as  it  reaches  an  open  state.  Under 
opening-limited  conditions,  the  cleavage  rate  is 
equal  to  the  unfolding  rate  at  a  given  cleavage  site 


Table  1.  Kinetic  and  thermodynamic  parameters  for  the  IRTPP  cleavage  site 


Temperature  (°C) 

19 

25 

31 

37 

Peptide  kcM/KM  (min  ’M  ]) 

7.66  ±  0.23  x  10* 

8.92±0.04x  106 

1 0.6  ±  0.26  xlO6 

1 2.5  ±  0.46  xl(T 

Dimer  kop,  (min  ') 

3.53  ±0.38x10  3 

5.08  ±0.51  x  10  3 

7.60±0.70x  10  3 

1.19±0.13*  10  2 

Capsid  kop,  (min  ’) 

2.44  ±0.24  x  10  1 

4.57±  0.78x10  1 

1.13±0.10x  10  2 

1.29  ±0.07  x  10  2 

Dimer  (Kop) 

1.33±0.83x  10  2 

7.54±0.26x  10  3 

6.34  ±  0.80  x  10  3 

2.59  ±  1.03x10  3 

Capsid  (Kop) 

5.12±2.28x  10  3 

1.78±  1.56x10  3 

2.90  ±  0.62x10  3 

1 .24  ±  0.30*  10  2 

Dimer  opening:  AG°  (kj  mol  ’) 

11.59 

12.11 

12.79 

15.35 

Capsid  opening:  AG°  (kj  mol  ’) 

18.40 

15.68 

14.77 

11.31 
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Figure  4.  Determination  of  Kop  and  kopm  from  observed  rate  constants  plotted  versus  trypsin  concentration,  (a)  Over  a 
wide  range  of  concentrations  the  data  can  be  described  by  a  hyperbola.  Under  opening-limited  conditions  kexp  approaches 
a  fixed  limit:  these  plots  are  fit  to  the  full  kinetic  equation  (equation  (7)),  giving  Kop  and  fcopen  values.  Data  shown  is  for  the 
dimer  at  37  °C.  (b)  In  cleavage-limiting  conditions,  plots  of  kvxp  are  linear  with  respect  to  enzyme,  with  a  slope  of  Kopkcat/ 
Km.  Data  shown  are  for  the  capsid  at  37  °C. 


(^open)*  Under  these  reaction  conditions,  the  insensi¬ 
tivity  of  the  resulting  kopen  value  to  the  kca{/KM 
constant  and  enzyme  concentration  makes  this  an 
ideal  initial  screen  for  dynamic  behavior.  In  order  to 
determine  Kop,  reaction  conditions  are  shifted  to  the 
other  extreme,  dilute  trypsin,  such  that  the  trend 
between  the  experimental  rate  constant  (kexp)  and 
enzyme  concentration  becomes  linear  ("'cleavage- 
limited";  Figure  4(b)). 

Unfolding  rates  and  equilibria 

Though  kexp  varies  between  dimer  and  capsid  by 
more  than  20-fold,  kopen  is  nearly  identical  between 
the  two  forms  across  the  tested  temperature  range. 
The  temperature  dependence  of  kinetic  parameters 
show  a  steady  increase  in  the  rate  of  the  opening 
transition  between  19  °C  and  37  °C  for  both  capsid 
and  dimer  forms  of  Cpl49  (Figure  5(a)).  In  order  to 
fully  characterize  Kop  and  A'open  values  for  dimer  and 
capsid  forms  of  Cpl49,  digestion  curves  (Figure 


(a) 


4(a))  were  generated  for  temperatures  between 
19  °C  and  37  °C.  When  feasible,  strict  cleavage- 
limited  reactions  were  also  carried  out  to  obtain  a 
greater  degree  of  precision  (Figure  4(b)). 

The  uniform  behavior  between  dimer  and  capsid 
with  respect  to  opening  rate  is  in  contrast  to  the 
temperature  dependence  of  the  opening  equilibrium: 
the  equilibrium  between  open  and  closed  states 
follows  opposite  trends  for  capsid  and  dimer.  For 
capsid,  Kop  increases  from  19  °C  to  37  °C  (the  amount 
of  open  state  changing  from  0.05%  to  1.24%;  Figure 
5(b)).  In  the  dimer,  at  low  temperature  (19  °C)  the 
region  surrounding  residues  127/128  is  relatively 
exposed  to  enzymatic  cleavage,  with  approximately 
1 .1%  of  the  dimers  existing  in  the  open  conformation. 
The  equilibrium  for  dimer  shifts  toward  the  closed 
conformation  with  increasing  temperature,  reaching 
0.259%  at  37  °C.  Since  the  opening  rates  for  both 
capsid  and  dimer  are  nearly  identical,  these  differ¬ 
ences  in  exposure  equilibrium  can  be  attributed  to 
the  closing  rate  for  the  C-terminal  region. 


Figure  5.  Kinetic  and  thermodynamic  constants  of  Cpl49  motion,  (a)  The  rate  of  de-protection  does  not  differ 
substantially  between  dimer  and  capsid  forms  of  HBV.  For  both  dimer  and  capsid  a  gradual  increase  in  the  rate  of 
transition  between  protected  and  exposed  states  can  be  seen  with  increasing  temperature,  (b)  Equilibrium  between 
protected  and  cleavable  states  changes  with  temperature.  Measured  Kop(kcat/  KM)  values  are  corrected  for  changes  in 
intrinsic  enzymatic  activity  for  each  temperature  (Table  1),  giving  the  equilibrium  constant  for  de-protection.  The  capsid 
(open  red  circles,  continuous  line)  exhibits  the  expected  behavior  with  increasing  exposure  to  enzymatic  cleavage  as  the 
temperature  increases.  Unassembled  dimer  (filled  blue  squares,  broken  line)  is  seen  to  be  most  exposed  to  proteolysis  at 
reduced  temperatures,  while  increasing  the  temperature  causes  a  shift  in  conformation  to  a  more  tightly  folded  state. 
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Thermodynamics 

The  equilibrium  constant  for  the  open-close  tran¬ 
sition,  K0 p,  shows  different  behavior  for  dimer  and 
capsid.  Comparing  dimer  and  capsid,  values  of  Kop 
at  a  particular  temperature  differ  by  up  to  two 
orders  of  magnitude,  and  Kop  trends  between  the 
two  assembly  states  have  opposite  temperature 
dependence  (Figure  5(b)).  Van't  Hoff  plots  of  the 
temperature  dependence  of  the  closing  transition 
give  an  enthalpy  of  -125.9  kjmol  1  and  an  entropic 
contribution  of  -368  J  K  ]mol  1  for  the  closing  pro¬ 
cess  in  capsids.  For  dimer  Kop  decreases  with  in¬ 
creasing  temperature,  indicating  entropic  stabiliza¬ 
tion  of  the  folded  state  with  AH  = -»- 54.5  kj  mol  1  and 
AS  =  +  224  J  K  ^ol  1  for  closing.  These  values  for 
dimer  have  the  same  trends  as  parameters  calcu¬ 
lated  from  capsid  assembly  experiments2 '  and  differ 
by  less  than  an  order  of  magnitude,  suggesting  a 
connection  between  folding  of  the  C  terminus  and 
the  capsid  assembly  reaction.  Based  upon  the  sign 
changes  in  enthalpy  and  entropy  of  the  closing 
transition  for  dimer  and  capsid,  the  C  terminus  of 
Cpl49  has  fundamentally  different  behavior  in  the 
two  assembly  states. 

These  results  show  that  dimer  and  capsid  forms 
of  Cpl49,  or  the  surrounding  microenviron¬ 
ments,  have  heterogeneous  dynamic  behavior.  At 
37  °C,  roughly  1%  of  capsid  protein  is  in  the  open 
conformation,  while  dimer  is  almost  entirely  closed. 
At  19  °C,  the  properties  are  reversed.  There  is  also 
no  correlation  between  the  folded  state  of  the  C 
terminus  and  the  final  assembly  condition  of 
Cpl49:  the  open  structural  state  observed  for 
Cpl49  can  be  achieved  by  both  dimer  and  capsid 
under  the  appropriate  conditions,  while  the  pri¬ 
mary  difference  between  dimer  and  capsid  is  the 
opposite  temperature  dependence  of  the  opening 
equilibrium. 

Discussion 

To  date  kinetic  analysis  of  hydrolysis  has  been 
largely  relegated  to  the  study  of  model  systems  and 
small  monomeric  proteins.  We  have  shown  here  that 
the  energetics  behind  conformational  flexibility, 
even  within  supramolecular  complexes,  can  be 
measured  using  kinetic  hydrolysis.  Besides  identifi¬ 
cation  of  the  cleavage  site  using  mass  spectrometry, 
only  standard  lab  equipment  is  required  for  these 
experiments.  Preliminary  SDS-PAGE  and  mass 
spectrometry  of  HBV  Cpl49  reaction  products 
showed  that  proteolysis  in  both  dimer  and  capsid 
forms  exclusively  targets  the  trypsin  site  R127/T128. 
Because  the  initial  proteolytic  reaction  is  focused  on 
a  single  site,  the  kinetic  analysis  is  practical  and 
unambiguous.  Computational  modeling  of  proteo¬ 
lytic  reactions  indicates  that  even  a  minor  secondary 
cleavage  pathway  can  have  a  significant  impact  on 
the  calculated  equilibrium  and  rates.6364  These 
concerns  make  it  vital  that  measurements  of  dyna¬ 
mic  motion  based  on  the  disappearance  of  starting 


material  be  the  result  of  a  single  cleavage  pathway. 
Previously  this  requirement  has  been  met  by  limit¬ 
ing  the  application  of  kinetic  hydrolysis  to  small 
proteins  that  are  well  folded.61,62  ^  Supramolecular 
complexes  can  be  amenable  to  the  same  analyses, 
provided  suitable  precautions  are  met. 

The  application  of  kinetic  hydrolysis  to  supramo¬ 
lecular  complexes  introduces  a  series  of  structural 
concerns.  It  is  possible  that  structural  rearrange¬ 
ments  such  as  dimer  dissociation,  capsid  integrity, 
or  destabilizing  cleavages  could  all  affect  the 
measured  rates,  but  these  factors  were  found  to  be 
absent  under  the  conditions  studied.  Capsids  are 
very  robust  and  no  detectable  dissociation  has  been 
observed  with  the  buffer  conditions  used  in  these 
experiments.23  At  37  °C,  dimer  is  seen  to  have  a 
reduced  Kop  value  compared  to  capsid,  ruling  out 
dimer  dissociation  as  an  intermediate  in  the  reaction 
pathway  of  capsid  proteolysis.  Dynamic  light 
scattering  and  size-exclusion  chromatography 
experiments  show  no  general  capsid  destabilization 
during  the  course  of  the  reactions,  and  by  collecting 
data  only  on  solutions  with  greater  than  70% 
completely  intact  protein,  destabilization  is  further 
minimized.  Localized  effects  as  a  result  of  cleavage 
within  the  clusters  of  five  or  six  C  termini  at  the 
capsid  pores  cannot  be  ruled  out,  but  the  loss  of  the 
cleavage  peptide  at  the  terminus  is  not  expected  to 
reduce  steric  protection  for  neighboring  dimers. 
Besides  accessibility,  the  cleavage  site  stability  is 
maintained  as  the  reaction  progresses.  The  stabiliz¬ 
ing  hydrophobic  cluster  consisting  of  amino  acids 
L140,  F110,  and  Y118  is  formed  within  one  dimer, 
and  by  restricting  data  collection  to  only  30% 
completion,  there  is  less  than  10%  likelihood  of  a 
second  cleavage  occurring  adjacent  to  an  already 
cleaved  site. 

We  find  that  HBV  capsids  and  core  protein  dimers 
are  very  dynamic  compared  to  the  small  protein 
examples  known.  The  exposure  of  the  R127/T128 
cleavage  site  is  not  a  rare  process,  with  equilibrium 
values  ranging  between  5  *  10  4  and  0.0124.  Similar 
studies  of  other  proteins  have  found  values  more 
than  200-fold  lower.30,6 1,66  It  may  also  be  possible 
that  the  magnitude  of  motion  required  for  transition 
between  the  folded  and  protease-accessible  states  is 
far  greater  than  that  required  for  biological  activity 
in  the  native  particles,  resulting  in  effectively  higher 
exposure  for  native  interactions.  While  the  mea¬ 
sured  Kop  equilibrium  is  larger  than  values  observed 
for  other  systems,  ka pen  trends  are  relatively  sim¬ 
ilar.30  This  indicates  that  the  high  exposure  distribu¬ 
tion  of  the  C  terminus  is  maintained  by  a  very  slow 
closing  rate.  In  a  biological  context,  these  para¬ 
meters  give  an  average  of  three  individual  mono¬ 
mers  per  capsid  in  the  open  state.  Because  all 
reactions  are  carried  out  in  solution  and  represent 
the  average  for  ~2*1012  individual  capsids,  it  is 
impossible  to  say  whether  the  opening  transition  is 
independent,  or  cooperative  in  the  context  of  a 
capsid.  It  is  attractive  to  speculate  that  several 
adjacent  subunits  around  a  5-fold  or  quasi-6-fold 
vertex  open  at  the  same  time. 
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Similar  to  the  equilibrium  trends,  the  exposure 
rates  of  the  cleavage  site  indicate  the  dynamic  nature 
of  Cpl49.  The  slowest  opening  transition  to  a 
cleavage-accessible  state  is  found  for  capsid  at 
19  °C,  with  a  rate  constant  of  3.5  day  ] .  While  this 
is  comparable  to  values  seen  for  sets  of  stable  amide 
protons  in  HDX  experiments/1  the  motions 
required  for  the  "open"  state  in  hydrogen -exchange 
are  extremely  small,  sometimes  less  than  an  ang¬ 
strom  unit.52  In  contrast,  proteolysis  requires  a 
substantial  region  of  unfolded  backbone,4**  and  in 
the  case  of  Cpl49,  steric  constraints  from  the  four- 
helix  turrets  increase  the  required  exposure  for 
cleavage.  The  total  translation,  between  the  con¬ 
formation  in  the  crystal  structure  to  an  estimated 
cleavage-accessible  unfolded  state,  may  necessitate 
as  much  as  a  13  A  motion  for  the  R127/T128  pair 
alone.  This  large-amplitude  motion  is  sampled  on  a 
moderate  timescale  for  even  the  most  strongly 
folded  conditions  studied  (19  °C,  capsid). 

The  Cpl49  C  terminus  likely  populates  an 
ensemble  of  conformations  in  solution.  The  appli¬ 
cation  of  kinetic  hydrolysis  can  only  resolve  the 
rate  of  transition  across  an  arbitrary  boundary 
between  protease-protected  and  accessible  confor¬ 
mations.  A  two-state  approximation  has  been  a 
sufficient  and  successful  model  for  kinetic  hy¬ 
drolysis  studies  of  other  proteins.60,62'65'7071  Our 
results  reaffirm  applicability  of  a  two-state  closed/ 
open  system  to  HBV  Cpl49  dimer  and  capsid. 
While  there  may  be  subtle  complications  beyond 
the  two-state  behavior,  any  deviations  present  are 
insignificant  within  the  error  of  the  data.  The 
kinetics  follow  the  predicted  pattern  for  a  partially 
protected  peptide  backbone:  cleavage  rates  of 
the  dimer  are  approximately  150-fold  slower  com¬ 
pared  to  the  unstructured  model  peptide.  How¬ 
ever,  the  observed  opening  rates  (Figure  5(a))  far 
exceed  predicted  values  for  non-dynamic  protein 
structures.  2  The  observed  rate  of  digestion  also 
reaches  an  upper  limit  in  the  opening-limited  con¬ 
ditions  of  high  enzyme  concentration  (Figure  4(a)), 
indicating  that  transient  interconversion  is  respon¬ 
sible  for  the  protection  effect.  This  phenomenon 
has  been  observed  with  other  known  two-state 
systems.50,70 

Structures  containing  atomic-level  detail  of  the  C 
terminus  of  Cpl49  in  capsid  form,  crystallized  at 
~21  °C  to  24  °C,  are  available.13  14  These  models  are 
in  good  agreement  that  residues  G112-T142  adopt  a 
defined,  folded  helix-loop-arm  conformation,  with 
L140  wrapping  around  to  meet  with  residues  F110 
and  Y118  (see  Figures  1(b)  and  6(a)  and  (c)).  The  site 
of  cleavage,  R127/T128,  is  located  near  the  termi¬ 
nus  of  helix  5  in  this  helix-loop-arm.  The  relative 
size  of  the  protease  is  shown  next  to  the  capsid  in 
Figure  1.  Rigid  body  docking  of  the  crystal 
structures  indicates  that  substantial  unfolding,  as 
described  above,  must  take  place  to  allow  the 
cleavage  site  to  reach  the  enzyme  active  site.  In  this 
case,  the  static  models  from  crystallography  and 
cryo-electron  microscopy  are  silent  to  the  observed 
dynamics. 


Figure  6.  Solution  phase  dynamics  localize  to  a  well 
ordered  helix  in  the  structural  model.  The  structure  of 
HBV  Cpl49  is  shown  in  dimer  and  capsid  forms  ((a)  and 
(b)  and  (c),  respectively:  not  to  scale),  (a)  Dimer:  the 
trypsin  cleavage  site  between  residues  127  and  128  is 
indicated  with  arrows.  Despite  the  presence  of  alternative 
protease  sites  for  trypsin  and  thermolysin  (shown  in  red), 
cleavage  is  restricted  to  the  end  of  helix  5  (in  green),  (b) 
Local  overhead  view  of  a  quasi-6-fold  center  of  symmetry 
within  the  assembled  capsid.  Shown  as  a  solvent-excluded 
surface,  the  cleavage  sites  (green)  are  barely  exposed  to 
solvent.  The  observed  cleavage  location  (green)  is  central 
to  the  packing  interface  between  dimers  at  the  5-fold  and 
6- fold  axes  of  symmetry  in  the  capsid.  The  indicated 
boundary  shows  the  extent  of  Figure  5(c).  (c)  Cutaway  of 
Figure  5(b),  rotated  90°  away  from  the  viewer.  Dimers 
make  close  contact  centered  at  helix  5  (red  chain), 
preventing  access  to  the  cleaved  bond  (green  spheres). 
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Based  on  the  differences  between  structural  data 
and  our  dynamics  results,  we  postulate  that  free 
dimer  adopts  a  conformation  where  the  C-terminal 
domain  is  loosely  held  in  place  by  hydrophobic 
interactions  in  a  manner  that  is  distinct  from  capsid. 
In  capsids,  the  cleavage  site  is  dynamic  but  leaves 
most  hydrophobic  contacts  intact  during  breathing 
modes,  which  results  in  the  change  of  sign  for  the 
thermodynamic  parameters.  This  model  is  also 
consistent  with  previous  studies  that  have  detailed 
the  thermodynamics  of  capsid  formation.2'  The 
temperature  dependence  of  dimer  opening  also 
clarifies  the  role  of  small  molecules  such  as  HAP 
and  Zn2*  in  the  allosteric  regulation  of  assem¬ 
bly.14,21'2  At  21  °C  Zn2+  induces  trapped  assembly 
while  at  37  °C  it  leads  to  virus-like  particles.  The 
HAP  molecules  strongly  enhance  assembly  and 
have  been  shown  to  pack  against  the  C-terminal 
helix  and  arm.14  In  the  assembled  capsid  the 
C-terminal  arm  is  tightly  packed  into  the  inter¬ 
dimer  hydrophobic  interfaces.14 14  Our  results  indi¬ 
cate  that  the  assembled  HBV  capsid  can  be  de¬ 
scribed  as  a  thermodynamic  balancing  act  between 
the  inter-subunit  affinity,  which  promotes  assembly 
and  localized  opening  which  allow  the  C-terminal 
domain  to  be  dynamic. 

The  wealth  of  data  available  on  the  behavior  of 
the  HBV  capsid  allows  these  results  to  be  put  into 
context  with  respect  to  the  biology  of  the  virus.  In 
vivo ,  the  dynamic  behavior  of  the  C  terminus  plays 
a  general  role  in  the  lifecycle  of  HBV.  2  The 
construct  used  for  these  experiments,  with  deletion 
of  the  C-terminal  arginine-rich  region,  may  be 
thought  of  as  the  minimal  structure  which  still 
retains  the  features  of  the  native  protein.74  By  using 
this  variant,  which  is  well  studied  from  the 
perspective  of  both  assembly  and  structure,  the 
kinetic  hydrolysis  measurements  can  be  compared 
to  data,  which  is  unavailable  for  the  full-length 
capsid  protein.  However,  the  full-length  protein 
also  has  behavior  which  is  in  agreement  with  our 
observations  of  Cpl49.  The  RNA  binding  domain, 
the  C  terminus  (residues  150-183)  of  the  full-length 
core  protein,  is  located  on  the  interior  of  the  capsid 
and  directly  exposed  to  the  nucleic  acid.  However, 
the  phosphorylation  state  is  altered  during  assem¬ 
bly,  maturation,32'4741,42  74,74  and  nuclear  localiza¬ 
tion  of  assembled  particles.33,35'3676  Transient 
exposure  of  this  domain  will  allow  reaction  with 
kinases,  phosphatases,  and  the  nuclear  transport 
machinery.  Secretion  of  mature  cores  from  the  cell 
must  take  advantage  of  other  host  transport 
machinery,  probably  including  the  ubiquitin  ligase, 
Nedd4.3s  However,  Nedd4  is  predicted  to  bind  to  a 
late  domain  PPAY  (residues  129-132;  Figure  1(c)) 
that  is  largely  buried. 4S  Our  study  indicates  that 
PPAY,  which  is  adjacent  to  the  cleavage  site,  will  be 
transiently  exposed.  We  can  reasonably  hypothesize 
that  the  nucleic  acid  content  of  the  core  will  affect 
the  breathing  modes  that  expose  the  RNA-binding 
and  late  domain  sequences,  providing  a  mechanism 
for  DNA  synthesis  to  signal  the  next  steps  in  the 
virus  life  cycle. 


Conclusion 

The  use  of  proteolytic  digestion  to  probe  dynamic 
behavior  of  proteins  is  well  established  and  provides 
an  alternative  approach  to  spectroscopic  means  for 
obtaining  thermodynamic  information.45  50,61  62 
However,  the  advantages  it  provides  (simplicity 
despite  structural  size  and  complexity;  amenability 
to  varying  buffer,  pH,  and  temperature  conditions; 
and  sensitivity  to  large-scale  motion)  have  not  been 
fully  explored.  We  have  demonstrated  the  utility  of  a 
quantitative  kinetic  proteolysis  approach  to  study 
the  4  MDa  hepatitis  B  viral  capsid.  Our  findings 
indicate  that  the  C  terminus  is  highly  dynamic, 
resulting  in  transient  exposure  to  the  surface  of  the 
capsid.  Similarly  internalized  domains  in  other 
icosahedral  capsids  are  known  to  be  dynamic  and 
exposed  in  solution.7  49  55  77  78  The  detailed  charac¬ 
terization  of  this  motion  for  the  HBV  capsid  has 
shown  that  the  dynamics  are  highly  temperature 
dependent.  In  dimer,  the  opening  transition  appears 
to  have  negative  entropy  and  positive  enthalpy, 
while  in  capsid  the  reverse  is  true.  This  thermo¬ 
dynamic  switch  between  oligomeric  states  may  help 
explain  the  variety  of  roles  observed  for  the  C 
terminus,  where  structural  localization  is  predicted 
to  vary  in  the  course  of  the  HBV  lifecycle.  The 
approximate  match  between  energetics  of  C  termi¬ 
nus  "closing"  in  the  dimer  and  capsid  assembly  may 
indicate  that  folding  in  this  region  is  part  of  the 
pathway  for  capsid  assembly,  although  further 
investigation  is  needed  to  clarify  the  role  of  any 
intermediates.  The  measurements  of  kinetic  and 
thermodynamic  parameters  made  possible  with 
kinetic  hydrolysis  provide  the  first  mechanistic 
evidence  for  dynamic  structural  behavior  within 
the  highly  biologically  relevant  region  of  the  hepa¬ 
titis  B  core  protein. 

Materials  and  Methods 

HBV  protein  preparation 

HBV  capsids  were  produced  from  recombinant  Cpl492/ 
expressed  in  BL21  DE3  Escherichia  coli  and  purified  with 
multiple  cycles  of  size-exclusion  chromatography  as 
previously  described.  q  Capsids  were  stored  in  50  mM 
Hepes  (pH  7.5),  0.5  M  NaCl,  2  mM  DTT  at  4  °C.  Purified 
capsids  were  dissociated  with  3  M  urea  on  ice  and  isolated 
with  size-exclusion  chromatography  to  produce  a  dimer 
stock.  Storage  conditions  for  the  dimer  were  50  mM 
Na2C03  (pH  9.6),  2  mM  DTT  at  4  °C.  The  stock  solutions  of 
Cpl49  were  regularly  checked  using  FPLC  size-exclusion 
chromatography  (100  cm  Sephacryl  500;  GE  Healthcare 
Inc.;  Chalfont  St.  Giles,  GBR)  with  detection  at  280  nm  and 
dynamic  light  scattering  Brookhaven  90Plus  (Brookhaven 
Instruments  Corp.;  Holtsville,  NY). 

Immediately  before  use,  samples  were  diluted  into 
reaction  buffer.  Although  previous  research  has  shown 
the  robust  stability  of  Cpl49  dimers  and  capsid  under  the 
buffer  conditions  used,1  example  reactions  were  run 
within  a  temperature-controlled  Brookhaven  90Plus 
dynamic  light  scattering  (DLS)  instrument  to  monitor  the 
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assembly  state  during  the  progress  of  digestion.  DLS  was 
measured  at  90°  using  a  661  diode  laser,  and  correlation 
functions  were  fit  using  a  non-negatively  constrained  least- 
squares  analysis.  As  the  proteolysis  progressed,  no 
decrease  in  the  primary  signal  was  observed,  nor  were 
there  any  smaller  particles  or  aggregate  material  seen. 
Capsid  integrity  was  also  evaluated  by  size  exclusion 
chromatography  with  a  Superose  6  column  (Amersham- 
Pharmacia,  Piscataway,  NJ).  Buffer  conditions  were  iden¬ 
tical  to  those  used  for  proteolysis  reactions  (see  below),  and 
detection  was  followed  at  280  nm.  The  experimental 
results  provided  additional  evidence  for  the  continuing 
integrity  of  the  capsids  (see  Results). 

Proteolysis  reactions 

Reaction  solutions  consisted  of  0.5  mg/ml  of  HBV 
protein,  150  mM  NaCl,  100  mM  Tris  (pH  7.5),  and  variable 
concentrations  of  sequencing-grade  modified  trypsin 
(Promega  Corp.;  Fitchburg,  WI).  All  buffers  were  indivi¬ 
dually  prepared  to  account  for  the  temperature  sensitivity 
of  Tris.  Trypsin  was  stored  in  frozen  aliquots  in  1  mM  HC1: 
any  dilutions  were  made  immediately  before  use.  Each 
reaction  was  initiated  by  addition  of  an  appropriate 
volume  of  trypsin  and  vortexed.  A  typical  experiment 
consisted  of  50-100  gl  final  volume  incubated  in  a 
thermocycler  for  the  duration  of  the  digestion.  Aliquots 
of  3  gl  were  removed  in  triplicate  from  the  reaction  and 
immediately  diluted  into  12  gl  of  gel  loading  buffer, 
vortexed,  and  heated  to  100  °C  for  5  min.  Following 
boiling,  samples  were  cooled  on  ice,  briefly  centrifuged  to 
collect  condensation,  vortexed,  and  stored  on  ice  until  use. 
Prior  to  gel  loading,  the  frozen  samples  were  simulta¬ 
neously  boiled  at  100  °C  for  5  min,  cooled  to  25  °C, 
centrifuged,  vortexed,  and  maintained  at  25  °C  during  gel 
loading.  These  additional  precautions  for  sample  prepara¬ 
tion  were  used  to  maximize  reproducibility  in  gel  loading. 
Samples  were  run  on  4%-20%  (w/v)  gradient  and  18% 
non-gradient  SDS-polyacrylamide  gels  (Bio-Rad,  Her¬ 
cules,  CA)  with  Tris-glycine  running  buffer.  Running 
conditions  were  200  V  and  the  minimal  time  required  to 
ensure  separation  between  intact  protein  and  cleaved 
products,  typically  25  min.  Gels  were  prepared  according 
to  manufacturer's  instructions  for  staining  with  colloidal 
Coomassie  or  Sypro  Ruby  (Pierce  Biotechnology,  Rock¬ 
ford,  IL),  and  scanned  with  a  Typhoon  9410  laser  scanner 
(GE  Healthcare  Inc.)  for  Sypro  stain  and  Epson  4180 
scanner  (Seiko  Epson  Co,  Suwa,  JPN)  for  Coomassie  stain. 

Preliminary  experiments  were  conducted  to  ensure  that 
the  subset  of  the  hydrolysis  progression  assayed  for 
quantitative  measurements  was  representative  of  the 
overall  trend  and  did  not  span  any  discontinuities  (Figure 
2(a)  and  (c)).  To  ensure  the  integrity  of  the  viral  capsids,  for 
quantitative  reactions  all  data  were  collected  from  time 
points  containing  greater  than  70%  uncleaved  protein,  as 
determined  by  gel  densitometry.  For  each  enzyme 
concentration,  sampling  times  were  adjusted  as  necessary 
for  best  coverage  of  this  region.  Samples  were  removed 
from  reactions  either  as  individual  time  stamps  (Figure 
2(a)  and  (c))  or  in  triplicate  at  each  time  point  (Figure  2(b) 
and  (d)).  The  latter  approach  allowed  an  evaluation  of 
sampling  and  quantitation  error. 

Densitometry 

While  it  is  necessary  to  limit  the  extent  of  digestion 
when  working  with  a  protein  complex  to  insure  integrity, 
this  complicates  densitometry  because  the  observed 


differences  in  band  intensities  are  limited  and  subtle.  By 
restricting  proteolytic  cleavage  to  less  than  30%  comple¬ 
tion,  the  change  in  intact  protein  intensity  represents  a 
very  small  variable  range  for  typical  densitometry.  In 
order  to  maximize  the  accuracy  of  the  quantitation,  several 
novel  methods  were  employed.  Image  acquisition  is  the 
first  step  of  the  processing  pipeline  which  can  have  an 
impact  on  the  final  result:  concerns  typically  include 
linearity,  sensitivity,  and  dynamic  range.80  However,  for 
quantitation  of  band  intensities,  which  are  never  more 
than  ±15%,  these  effects  are  minimal.  In  order  to  test  this 
assumption,  gel  images  were  stained  with  the  standard 
fluorescent  stain  SYPRO  ruby  and  image  acquisition  was 
performed  using  a  Typhoon  9400  laser  scanner.  Gels  were 
then  stained  with  the  colloidal  Coomassie  stain  and 
imaged  with  an  Epson  4180  flatbed  CCD  scanner.  For 
the  types  of  gel  images  used,  comparative  analysis 
showed  no  significant  deviation  between  the  two  meth¬ 
ods.  Since  acquisition  introduces  minimal  error  for 
homogeneous  images,  detection  and  quantitation  of  gel 
bands  was  addressed  as  the  primary  source  of  variability. 
Image  manipulation  was  performed  using  lossless  opera¬ 
tions  in  Adobe  Photoshop  CS,  followed  by  analysis  using 
ImageJ  (Rasband,  U  S  National  Institutes  of  Healtht  with  a 
manual  watershed  isolation  method  and  a  high-resolution 
modified  version  of  the  GelAnalyzer  module.  Band 
intensities  were  plotted  and  fit  in  Origin  7.0  to  the  first- 
order  equation: 

/,  =  lp  -e  (2) 

where  It  is  the  measured  intact  protein  at  time  t,  l0  is  the 
intensity  at  t  =  0,  and  kvxp  is  a  first-order  rate  constant. 
These  precautions  made  it  possible  to  quantify  band 
intensities  to  an  acceptable  degree  of  precision:  the  total 
variability  for  replicate  lanes  was  typically  less  than  4% 
relative  standard  deviation. 

Mass  spectrometry 

Proteolysis  reactions  for  mass  spectrometry  analysis 
were  conducted  identical  to  those  for  gel-based  analysis 
except  for  termination;  in  some  situations  one  reaction  was 
simultaneously  analyzed  with  gels  and  mass  spectro¬ 
metry.  Quenching  of  proteolysis  was  performed  via 
multiple  methods  to  avoid  biasing  peptide  detection. 
These  included  dropping  the  pH,  addition  of  irreversible 
trypsin  inhibitors  such  as  PMSF,  boiling,  spotting  directly 
on  to  a  MALDI  plate,  and  direct  LCMS  injection.  For 
electrospray-based  MS  analysis  sample  quantity  was 
varied  over  four  orders  of  magnitude  spanning  the  typical 
detection  range.  Electrospray  instrumentation  included  an 
Agilent  XCT  Plus  (Agilent  Technologies,  Santa  Clara,  CA), 
Bruker  micrOTOF  LC  (Bruker  Daltonik  GmbH,  Bremen, 
DEU),  and  Waters  QToF  Premier  (Waters  Corp.,  Milford, 
MA),  all  interfaced  with  reverse-phase  HPLC  (C8  or  Cl 8: 
Phenomenex  Inc.,  Torrance,  CA).  MALDI  analysis  was 
performed  with  a  variety  of  sample/matrix  ratios  and 
both  sinapinic  acid  and  a-Cyano-4-hydroxycinnamic  acid, 
on  a  Bruker  BiFlex  III. 

Intrinsic  rate  determination  of  trypsin  cleavage 

To  determine  the  activity  of  trypsin  upon  the  observed 
cleavage  sequence  in  absence  of  any  structural  or  steric 


t  http:// rsb.info.nih.gov/ij/ 
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constraints,  the  fluorogenic  peptide  Abz-IRTPP-Tyr(3- 
N02)  was  characterized  as  a  trypsin  substrate.  While  the 
peptide  is  intact,  the  FRET  fluorescence-quenching 
nitrotyrosine  residue  decreases  intrinsic  fluorescence  of 
the  2-aminobenzamide  (Abz)  at  the  C  terminus.  Upon 
cleavage  at  the  R/T  trypsin  site,  the  Abz  fluorescence  is 
restored.8 1  The  synthetic  peptide  was  purchase  from  New 
England  Peptide  (Gardner,  MA)  at  >99%  pure  and  used 
without  further  purification.  Assays  were  carried  out  for 
each  temperature  under  the  same  conditions  as  for 
proteolytic  reactions:  150  mM  NaCl  and  100  mM  Tris 
(pH  7.5).  Peptide  concentration  was  varied  from  150  nM 
to  75  nM  in  order  to  generate  KM  curves.  Buffered 
solutions  containing  the  peptide  were  pre-incubated  in  a 
temperature-controlled  and  stirred  cuvette  with  a  Perki- 
nElmer  LS50B  fluorometer  (PerkinElmer,  Waltham,  MA) 
and  reactions  were  initiated  by  addition  of  trypsin  to  a 
final  concentration  of  9.6  nM  to  385  nM.  Changes  in  Abz 
fluorescence  were  observed  with  an  excitation  wave¬ 
length  of  318  nm  and  an  emission  wavelength  of  420  nm. 
Depending  on  trypsin  concentration,  analysis  was  carried 
out  by  fitting  data  to  either  initial-velocity  Michaelis- 
Menten  kinetic  models  or  with  an  exponential-rise 
model:82 


Ft  =Fo  +  (Fmax  —  Fo)  ■  V/K  ')  (3) 

Where  Ft  is  the  measured  fluorescence,  F0  is  the  fluo¬ 
rescence  at  f  =  0,  Fmax  is  fluorescence  at  t  =  o o,  and  V/K  is  a 
pseudo-first-order  rate  constant.  From  this  exponential- 
rise  model  kcat/KM  was  determined  by  dividing  V/K  by 
the  enzyme  concentration  of  the  reaction. 


Kinetic  models  of  proteolysis 


For  protein  domains  that  partition  between  folded  and 
unfolded  conformations,  the  rate  of  cleavage  can  be 
modeled  with  a  two-state  equilibrium  system: 


Closed 


Cleaved 


(1) 


Under  normal  conditions  without  the  presence  of  pro¬ 
tease,  an  equilibrium  is  established  between  open  and 
closed  forms  of  the  protein,  giving: 


K 


op 


and  with:  [Closed]* [Open]  =  [Total]: 


(4) 


I  Open]  =  i  •  [Total]  (5) 

For  typical  cases  with  small  values  of  Kop,  equation  (5) 
simplifies  to  [Open]  =  Kop  [Total],  removing  the  denomi¬ 
nator  term.  This  simplification  can  be  propagated  through 
derivative  equations  without  excessive  error.  In  the 
presence  of  a  proteolytic  enzyme,  the  intrinsic  cleavage 
rate  constant  kc jeave  can  be  calculated  as: 


*cl  leave  =|ii‘[E]  (6) 

with  kcM/KM  being  the  standard  catalytic  constants  for 
trypsin  cleavage  of  the  particular  peptide  sequence  and  [£] 
is  the  concentration  of  trypsin.  The  pseudo-first-order 
simplification  of  the  Michaelis-Menten  equation  is  justi¬ 
fied  because  the  measured  KM  value  for  this  sequence  was 


20  gM,  which  is  much  greater  than  the  concentration  of 
available  HBV  cleavage  sites.  A  steady-state  simplification 
of  the  system  in  equation  (1)  can  be  used  to  approximate 
the  observed  first-order  rate  constant  fcexp  as  a  function  of 
the  component  constants: 


^exp  — 


^open^cleave 
^close  T  £<; leave 


(7) 


This  produces  a  hyperbolic  trend  when  sufficient 
quantities  of  enzyme  are  used  in  the  digestions.  When 
^cat/Kjvi  [F]  »/cciose/  any  protein  in  the  open  conforma¬ 
tion  is  immediately  cleaved  and  kexp  approaches  a 
maximum  limit.  This  is  termed  "opening-limited  condi¬ 
tions",  and  under  such  situations  kexp  =  kopen.  Since  in 
practice  it  is  rarely  possible  to  achieve  perfectly 
enzyme-saturated  levels,  /ropen  is  typically  extrapolated 
by  fitting  equation  (7)  with  non-linear  regression  to  data 
collected  across  a  wide  range  of  enzyme  concentrations 
(Figure  3(a)). 

Alternatively,  when  enzyme  concentrations  are  low 
enough  that  cleavage  is  slow  relative  to  the  equilibrium 
process  (kcat/KM  E  <3CJccjc>se  ),  the  concentration  of  protein 
in  the  open  conformation  remains  a  function  of  the 
remaining  total  uncleaved  protein  (equation  (5)).  By 
combining  equations  (5)  and  (6),  the  linear  relationship 
between  enzyme  and  experimental  rate  constant: 


^p=fe[E1'Ko'’  (8) 

can  be  modeled  for  reactions  with  low  concentration  of 
enzyme.  This  is  termed  "cleavage-limited  conditions", 
and  data  from  such  reactions  can  be  fit  with  a  linear 
regression  (Figure  3(b)). 

Once  Kop  values  have  been  obtained,  AGdeprotection  is 
calculated  as  -RT  ln(Kop).  Not  to  be  confused  with  the 
energetics  of  proteolytic  cleavage,  this  thermodynamic 
parameter  is  the  free  energy  difference  between  the  open 
and  closed  conformation  states  of  the  protein. 
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Abstract  A  systematic  in  silico  approach  has  been 
employed  to  generate  sound,  experimentally  validated 
active-site  models  for  galactose  oxidase  (GO)  using  a 
hybrid  density  functional,  B(38HF)P86.  GO  displays  three 
distinct  oxidation  states:  oxidized  [Cu(II)-Y*|;  semire- 
duced  [Cu(II)-Y];  and  reduced  [Cu(I)-Y].  Only  the 
[Cu(II)-Y«l  and  the  [Cu(I)-Y]  states  are  assumed  to  be 
involved  in  the  catalytic  cycle,  but  their  structures  have  not 
yet  been  determined.  We  have  developed  several  models 
(1-7)  for  the  [Cu(II)-Y«]  state  that  were  evaluated  by 
comparison  of  our  computational  results  with  experimental 
data.  An  extended  model  system  (6)  that  includes  solvent 
molecules  and  second  coordination  sphere  residues  (R330, 
Y405,  and  W290)  is  essential  to  obtain  an  experimentally 
correct  electronic  structure  of  the  active  site.  The  optimized 
structure  of  6  resulted  in  a  five-coordinate  Cu  site  with  a 
protein  radical  centered  on  the  Tyr-Cys  cofactor.  We  fur¬ 
ther  validated  our  converged  model  with  the  largest  model 
(7)  that  included  additional  outer-sphere  residues  (Q406, 
H334,  Y329,  G513,  and  T580)  and  water  molecules. 
Adding  these  residues  did  not  affect  significantly  the  active 
site’s  electronic  and  geometric  structures.  Using  both  6  and 
7,  we  explored  the  redox  dependence  of  the  active-site 
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structure.  We  obtained  four-  and  three-coordinate  Cu  sites 
for  [Cu(II)-Y]  and  [Cu(I)-Y]  states,  respectively,  that 
corroborate  well  with  the  experimental  data.  The  relative 
energies  of  these  states  were  validated  by  a  comparison 
with  experimental  redox  potentials.  Collectively,  our 
computational  GO  models  well  reproduce  the  physico¬ 
chemical  characteristics  of  the  individual  states,  including 
their  redox  behaviors. 

Keywords  Galactose  oxidase  •  Copper/protein  radical 
active  site  •  Computational  biomimetic  model  •  Density 
functional  theory  •  Spectroscopic  calibration  of  theory 

Introduction 

Galactose  oxidase  (GO)  is  an  extracellular,  monomeric 
copper  protein  of  molecular  mass  approximately  68  kDa, 
isolated  from  the  fungus  Dactylium  dendroides  [If  It  is 
composed  of  three  distinct  domains,  organized  in  an 
exclusively  /J-sheet  structure  with  short  turns.  One  domain 
contains  a  two-electron  redox  unit  with  a  mononuclear 
copper  ion  and  a  cross-linked  Tyr-Cys  radical  cofactor  that 
are  responsible  for  its  catalytic  activity.  GO  catalyzes 
the  two-electron  oxidation  of  a  broad  range  of  primary 
alcohols  (e.g.,  D-galactose,  D-galactoseamine,  dihydroxy- 
acetone)  to  the  corresponding  aldehydes,  concomitantly 
reducing  dioxygen  to  hydrogen  peroxide  [1].  GO  displays 
three  distinct  oxidation  states:  oxidized  ([Cu(II)-Y*f 
green);  semireduced  ([Cu(II)-Y],  blue);  and  reduced 
([Cu(I)-Y],  colorless).  The  oxidized  and  the  reduced  states 
are  catalytically  important  in  the  two-electron  redox  reac¬ 
tion  during  the  catalytic  cycle  [2]. 

Crystal  structures  are  available  for  the  [Cu(II)-Y]  state 
at  pH  4.5  and  7.0  (1.7-  and  1.9- A  resolution,  respectively). 
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Both  structures  suggest  a  distorted  square-pyramidal  Cu(II) 
coordination  geometry  with  an  axial  Y495  ligand  and  four 
equatorial  ligands  of  H496,  H581,  H20  (pH  7.0),  or  acetate 
(pH  4.5),  and  the  unique  cross-linked  Tyr-Cys  cofactor 
(Fig.  1)  [3].  Detailed,  atomic-resolution  structures  of  both 
catalytically  important  states  ([Cu(II)-Y*]  and  [Cu(I)-Y]) 
are  not  yet  available  owing  to  their  inherent  instability  [4]; 
however,  structural  characterization  of  these  redox  states  is 
essential  to  understand  the  molecular  mechanism  of  GO. 

Spectroscopic  studies  have  provided  considerable 
insights  into  the  copper  coordination  environment  in  GO 
[2,  5],  On  the  basis  of  extended  X-ray  absorption  fine 
structure  (EXAFS)  studies,  it  was  suggested  that  the 
coordination  environment  of  the  oxidized  [Cu(II)-Y*]  is 
similar  to  that  of  semireduced  [Cu(II)-Y]  state  (Fig.  1 )  [5]. 
A  three-coordinate  Cu  site  is  suggested  for  the  reduced 
[Cu(I)-Y]  state  with  an  average  of  two  Cu-N  bond  dis- 

o 

tances  and  a  Cu-O  bond  distance  of  1.95  and  1.97  A, 
respectively  [6].  In  contrast  to  the  geometric  structures,  the 
electronic  absorption  spectra  [7]  of  the  GO  redox  states 
differ  significantly.  The  [Cu(II)-Y]  state  shows  weak 
absorption  bands  at  450  and  620  nm,  whereas  the  [Cu(II)- 
Y«]  state  displays  multiple,  intense  electronic  transitions 
that  span  the  visible/near-infrared  regions  with  maximum 
intensity  at  445  and  800  nm.  These  bands  are  assigned  as 
ligand-to-metal  charge  transfer  (LMCT)  and  ligand-to- 
ligand  charge  transfer  (LLCT)  transitions,  respectively  [4, 
8].  The  fully  reduced  [Cu(I)-Y]  state  shows  no  character¬ 
istic  signatures  in  the  electronic  spectral  region  owing  to 
the  filled  d  orbitals  of  the  Cu(I)  ion  and  the  nonradical  form 
of  the  Tyr-Cys  cofactor. 

A  computational  study  on  the  catalytic  cycle  of  GO  has 
been  previously  reported  employing  a  semiquantitative 
analysis  of  the  experimental  data  and  relevant  model  sys¬ 
tems  of  GO  [9].  More  recently,  quantum  mechanical/ 


Fig.  1  Crystal  structure  of  the  active  site  of  galactose  oxidase  (GO) 
at  1.9  A  resolution  in  the  [Cu(II)-Y]  state  including  residues  known 
to  affect  catalytic  activity  with  selected  Cu-ligand  distances  (Protein 
Data  Bank  ID  1GOG)  [3].  The  average  Cu-N  and  Cu-O  extended 
X-ray  absorption  fine  structure  (EXAFS)  distances  for  oxidized 
[Cu(II)-Y*]  are  1.97  and  1.96  A,  respectively  [5] 


molecular  mechanics  Car-Parrinello  simulations  of  GO 
and  its  synthetic  active-site  analogs  were  carried  out  [10] 
using  a  small  computational  model  and  hybrid  density 
functional  theory  (DFT;  B3LYP)  [11].  These  results  pro¬ 
vided  new  perspectives  for  developing  structure-reactivity 
correlations  for  GO.  This  work  suggested  a  distorted 
tetragonal  Cu  site  for  the  [Cu(II)-Y*]  state  with  longer  Cu- 
O  (H20  and  Y495)  distances  of  2.6-3  A,  than  observed  in 
the  EXAFS  (1.96  A)  [5,  10].  Notably,  these  calculations 
indicated  that  the  protein  radical  was  located  on  the  axial 
ligand  Y495  rather  than  the  equatorial  Tyr-Cys  cofactor 
[11].  However,  electron  paramagnetic  resonance  studies 
have  demonstrated  that  the  protein  radical  is  on  the  cross- 
linked  Tyr-Cys  cofactor  for  [Cu(II)-Y«]  [12-15].  In 
addition,  the  magnetic  susceptibility  study  revealed  that  the 
protein  radical  is  antiferromagnetically  coupled  with  the 
Cu(II)  ion,  resulting  in  a  diamagnetic  ground  state  for 
[Cu(II)-Y*]  [2].  Therefore,  further  investigations  are  nee¬ 
ded  to  reconcile  the  experimental  findings  with  the 
calculated  electronic  and  geometric  structures  of  the  active 
site.  Moreover,  there  is  a  growing  interest  in  designing 
synthetic  biomimetic  analogs  [16-22]  based  on  the  GO 
active-site  architecture  for  stereoselective  alcohol  oxida¬ 
tion  that  can  be  aided  by  gaining  more  insights  into  the 
structures  of  the  important  oxidation  states  of  GO,  and  the 
factors  that  govern  its  reactivity. 

We  describe  here  a  systematic  in  silico  approach  for 
developing  accurate  active-site  models  of  GO  to  study  the 
coordination  geometries  and  electronic  structures  of  all 
three  distinct  redox  states.  A  spectroscopically  calibrated 
hybrid  density  functional  [B(38HF)P86]  was  employed, 
which  has  been  validated  for  mononuclear  copper  proteins 
[23,  24].  Our  model-building  strategy  was  designed  to 
evaluate  the  importance  of  each  inner-shell  and  outer-shell 
residue  on  the  geometric  and  electronic  structures  of  the 
active  site.  Each  model  (Fig.  2)  of  the  oxidized  GO  was 
rigorously  evaluated  by  comparing  the  calculated  and 
experimental  structures,  spin-density  distribution,  singlet- 
triplet  energy  gap,  and  electronic  absorption  spectra.  The 
calculated  [Cu(II)-Y]  and  [Cu(I)-Y]  states  were  obtained 
by  one-electron  reduction  of  the  optimized  [Cu(II)-Y»]  and 
[Cu(II)-Y]  states,  respectively.  In  addition  to  using  the 
molecular  orbital  coefficients  of  [Cu(II)-Y«],  [Cu(II)-Y] 
and  [Cu(I)-Y]  models  as  initial  orbital  guesses  for 
[Cu(II)-Y]  (blue  solid),  [Cu(II)-Y*]  (green  dotted  arrow) 
and  [Cu(I)-Y]  (black  solid  arrow),  [Cu(II)-Y]  (blue  dotted 
arrow),  respectively,  we  carried  out  optimizations  starting 
from  the  crystal  structure  of  the  [Cu(II)-Y)  form  with  the 
ionic  fragments  [25]  (Fig.  3).  These  fragments  correspond 
to  the  valence-bond  description  of  amino  acid  ligands  and 
the  Cu(II)  ion.  These  procedures  were  performed  to  eval¬ 
uate  the  robustness  of  the  optimization  method  and  to 
investigate  protein-strain  effects. 
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Fig.  3  Systematic  mapping  of  the  potential  energy  surface  of  redox 
states  for  the  GO  active  site 

Overall,  our  study  quantitatively  shows  that  the  protein 
environment  has  significant  influences  on  the  properties  of 
the  active  site,  consistent  with  previous  site-directed 
mutagenesis  studies  of  second  coordination  sphere  residues 
(W290H,  W290F,  W290G,  and  Y495F)  [26,  27].  It  is 
interesting  to  note  that  our  computational  models  were 
developed  without  following  the  site-directed  mutagenesis 
study,  rather  allowing  for  the  model  to  reveal  the  residues 
that  are  essential  in  determining  the  active-site  structure. 
We  have  already  communicated  the  results  of  a  converged 
model  for  the  structure  of  oxidized  [Cu(II)-Y*]  GO  [28].  In 
this  paper,  we  provide  the  full  details  of  our  model-building 
strategy  for  oxidized  [Cu(II)-Y*]  GO,  the  semireduced 
[Cu(II)-Y],  and  the  reduced  [Cu(I)-Y]  states  (Fig.  3).  We 
further  validated  these  models  by  comparing  the  redox 
potentials  for  [Cu(II)-Y*]/[Cu(II)-Y]  and  [Cu(II)-Y]/ 
[Cu(I)-Y]  with  the  experimental  values  [29].  Our  work  lays 


a  strong  foundation  for  investigating  the  molecular  mech¬ 
anism  of  GO.  It  also  provides  a  demonstrative  example  for 
using  a  systematic  in  silico  model-building  approach  for 
constructing  structurally  and  functionally  accurate  compu¬ 
tational  models  of  metalloprotein  active  sites. 

Computational  details 

A  spectroscopically  calibrated,  hybrid  density  functional 
[B(38HF)P86]  [24]  was  employed  in  our  computational 
study  using  the  Gaussian03  package  [30]  on  a  cluster  of 
Intel  Xeon  EM64T  servers.  The  Becke88  exchange  and 
Perdew86  correlation  functional  were  used  with  38%  of  the 
total  density  functional  exchange  replaced  with  Hartree- 
Fock  exchange,  giving  an  accurate  bonding  description  for 
mononuclear  copper  proteins  [23].  The  triple-C  (VTZ*) 
[31]  and  double-^  with  polarization  [6-3 1  G(d>]  [32-34] 
Gaussian-type  all-electron  basis  sets  were  employed  in  all 
calculations,  which  form  a  converged  basis  set  for  Cu(II)- 
containing  systems  [24,  35]. 

The  generalized  ionic  fragment  approach  [25]  was 
employed  for  model  building.  First,  the  molecular  orbitals 
of  each  residue  as  ionic  ligand  fragments  were  calculated  in 
their  crystallographic  positions  in  the  [Cu(II)-Y]  form 
(Protein  Data  Bank  ID  1GOG)  [3].  Second,  these  molecular 
orbitals  were  combined  to  give  a  specific  spin-polarized 
wave  function  for  the  complete  active-site  models.  Third, 
geometry  optimizations  were  performed  using  spin-unre¬ 
stricted  B(38HF)P86  to  maintain  the  open-shell  singlet 
character  of  the  wave  function.  As  summarized  in  Fig.  2, 
our  computational  models  span  from  a  truncated  model 
(H496  and  H581  modeled  as  imidazole,  Y495  modeled  as 
phenol,  Y272  modeled  as  thiophenol  and  H20)  to  an 
extended  model  system  with  214  atoms,  which  includes  the 
a-carbon  of  each  residue  coordinated  to  the  Cu(II);  the 
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protein  backbone  between  Y495  and  H496;  an  additional 
H20  (HOH294)  within  hydrogen-bonding  distance  from 
both  Tyr  residues  (approximately  3  A  in  the  crystal  struc¬ 
ture);  distal  residues  Y405,  Q406,  R330,  W290,  G513, 
T580,  H334,  and  Y329,  and  additional  crystal  water 
molecules. 

The  His  residues  were  modeled  as  charge-neutral,  Y495 
as  tyrosinate  (negatively  charged),  Y272  as  neutral  tyrosyl 
radical,  Y405  as  protonated  Tyr  (neutral),  R330  as  pro- 
tonated  guanidium  (positively  charged),  and  all  other 
residues  as  charge-neutral  unless  otherwise  noted.  The  total 
charges  of  models  1-3  were  +1,0,  and  —  1  and  those  of  4-7 
were  +2,  +1,  and  0  for  [Cu(II)-Y*],  [Cu(II)-Y],  and 
[Cu(I)-Y],  respectively.  For  models  2-7,  a-carbons  of  each 
residue  were  fixed  at  their  crystal lographically  determined 
positions  during  optimization  unless  otherwise  noted. 
Geometry  optimizations  were  performed  using  the  redun¬ 
dant  coordinate  system  and  Bemy  optimization  algorithm 
[36].  The  convergence  criteria  were  set  to  0.002  hartree/(au 
or  rad)  and  0.01  au  or  radian  RMS  change  in  energy  gra¬ 
dients  and  internal  coordinate  displacements,  respectively. 
Owing  to  the  relatively  lax  convergence  criteria,  geometry 
optimizations  were  terminated  only  when  stationary  points 
were  firmly  located  in  consecutive  calculations.  For  larger 
models,  this  procedure  avoided  trapping  in  a  higher-energy 
local  minimum.  Atomic  spin  densities  were  derived  from 
Mulliken  population  analysis. 

The  relative  energies  of  Kohn-Sham  orbitals  from  DFT 
calculation  have  been  shown  to  provide  reasonable  esti¬ 
mates  of  absorption  bands  for  copper  proteins  [37].  Time- 
dependent  DFT  (TD-DFT)  [38]  calculations  were  also 
performed  to  obtain  an  improved  description  of  electronic 
transitions  for  these  models.  Antiferromagnetic  coupling 
magnitudes  were  determined  from  the  energy  difference 
between  the  optimized  singlet  and  triplet  states  for  each 
model  using  spin  Hamiltonian  ySASB  ( ET  —  Es  =  2 
7-SCu(n)-STyr._Cys)-  Spin  projection  of  the  broken-symme¬ 
try  DFT  solution  to  obtain  Es  was  done  by  the  method 
suggested  by  Davidson  and  Clark  [39]. 

Redox  potentials  were  calculated  from  the  sum  of  the 
ionization  potentials  and  solvent  reorganization  energies 
relative  to  the  normal  hydrogen  electrode  at  —4.43  eV.  The 
solvent  reorganization  energies  were  implicitly  considered 
by  polarizable  continuum  calculations  using  various 
dielectric  constants  and  solvent  radii  of  1.385  A  (water). 
For  the  proton  source  in  the  protonated  structures,  we  used 
the  hydronium  ion  surrounded  by  four  explicit  water 
molecules.  This  was  further  embedded  into  a  standard 
water-based  polarizable  continuum.  In  order  to  obtain 
accurate  energy  for  the  proton  source  in  the  latter  system, 
we  used  a  triple-£-quality  basis  set  with  both  polarization 
and  diffuse  functions  (6-31 1+G*). 


Results  and  discussion 

First,  we  demonstrate  the  importance  of  using  a  system¬ 
atic  model-building  procedure  on  the  oxidized  form  of 
GO,  in  order  to  capture  all  the  significant  first  and  second 
coordination  sphere  effects.  We  selected  the  oxidized 
state  of  GO  owing  to  the  availability  of  numerous  spec¬ 
troscopic  measurements  that  are  utilized  to  validate  our 
computational  models.  Starting  from  the  simplest  active- 
site  model  (1),  we  systematically  extend  this  to  the  con¬ 
verged  structural  model  (6)  that  reproduces  most  of  the 
experimental  data.  We  show  that  a  considerably  larger 
model  (7)  does  not  improve  the  calculated  electronic  and 
geometric  structural  data  in  comparison  with  experiment 
relative  to  model  6.  Using  models  6  and  7,  we  describe 
the  structural  changes  that  accompany  the  relevant  redox 
reactions,  and  for  6  we  also  explicitly  evaluate  the  redox 
potentials. 

Systematic  development  of  a  converged  structural 
model  for  oxidized  GO 

Model  1 

The  smallest  GO  model  (1)  in  our  computational  studies 
comprises  four  equatorial  ligands,  H496,  H581,  the  cross- 
linked  Tyr-Cys  cofactor,  and  water,  and  an  axial  ligand, 
Y495.  Both  His  residues  are  modeled  as  imidazoles 
(charge-neutral),  the  axial  Y495  as  phenolate  (negatively 
charged),  and  the  cross-linked  thioether-substituted  Tyr  as 
a  CH3S-substituted  phenyl  radical.  This  model  is  similar  to 
that  used  in  an  earlier  computational  study  [11].  The 
optimized  structure  of  oxidized  1  with  selected  Cu-ligand 
distances  is  shown  in  Fig.  4a.  Model  1  can  be  characterized 
as  a  slightly  distorted  octahedral  Cu  site  with  a  phenol 
ligand  in  the  axial  position  at  2.29  A  distance.  The  sulfur 
atom  of  the  equatorial  CH3S-substituted  phenyl  is  located 
axially,  trans  to  the  phenol  ligand  at  a  distance  of  2.97  A. 
Two  imidazoles,  the  substituted  phenolate,  and  the  H20  are 
forming  the  equatorial  plane  with  Cu-ligand  distances 
ranging  from  1 .91  to  2.10  A,  and  with  cis  and  trans  ligand- 
Cu-ligand  angles  of  79-100°  and  165  and  179°,  respec¬ 
tively  (Fig.  4a).  The  aromatic  ring  of  the  axial  phenol  is 
rotated  more  than  90°  from  it  is  crystallographic  position, 
which  suggests  the  importance  of  steric  constraints  from 
the  protein  backbone  in  positioning  the  aromatic  ring 
almost  perpendicular  to  the  equatorial  plane  (Fig.  1 ). 

The  electronic  structure  of  oxidized  1  (Fig.  4b)  reveals 
0.74  e~  spin  density  located  on  the  Cu  ion,  and  the  rest  of 
the  a-spin  is  delocalized  onto  the  equatorial  ligands:  both 
His  ligands  (0.06  e~  on  each  eN),  phenolate  (0.09  e~ ), 
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Fig.  4  Optimized  structure  of  model  1.  a  Cu-ligand  distances 
(angstroms)  and  ligand-Cu-ligand  angles  (degrees),  b  Spin-density 
plot  (isosurface  of  ±0.003)  and  fragment  spin  densities,  and  c  Kohn- 
Sham  molecular  orbital  energies  (top)  and  the  highest  occupied 
molecular  orbital  (HOMO)  and  the  lowest  unoccupied  molecular 
orbital  (LUMO)  (isosurface  of  ±0.05)  of  both  a  and  /2  manifolds 
(bottom) 


and  H20  (0.04  e~).  The  /2-spin  is  mainly  confined  to  the 
axial  phenol  ring  (Fig.  4b),  which  is  inconsistent  with 
experimental  findings  [12-15].  Since  our  result  agrees 


with  previous  computational  studies  obtained  employing  a 
different  density  functional  [10,  11],  the  location  of  the 
spin  in  this  minimal  model  is  likely  determined  by  the 
computational  model  and  not  by  the  selected  functional  or 
basis  set. 

We  have  extensively  analyzed  various  initial  triplet  and 
singlet  state  structures  of  1  by  using  a  generalized  ionic 
fragment  approach  [25]  with  unpaired  electrons  located  at 
different  fragments,  such  as  Cu(II)/substituted  phenolate, 
Cu(II)/axial  phenol,  and  substituted  phenolate/axial  phenol 
with  a  Cu(I)  center  in  the  last  case  (see  “Computational 
details”).  In  all  cases,  the  structures  converged  to  the  same 
spin-density  distribution  as  described  above.  Furthermore, 
the  calculated  antiferromagnetic  coupling  (J  100  cm"1) 
is  considerably  smaller  than  the  experimental  antiferro¬ 
magnetic  coupling  (7>  200  cm“l).  In  addition,  the 
calculated  electronic  transitions  from  a  Kohn-Sham  orbital 
analysis  are  as  follows:  266  nm,  substituted  phenolate  to 
Cu(II)  3dx: (7i  ->  o’*),  and  469  nm,  substituted  pheno¬ 
late  to  phenyl  radical  (n  -+  7r)  (Fig.  4c),  which  are  also  in 
disagreement  with  the  experimental  electronic  transitions 
observed  for  [Cu(II)-Y«]  GO.  These  results  obtained  for 
the  minimal  coordination  chemistry  model  of  the  active  site 
suggest  that  the  protein  backbone  between  Y495  and  H496 
and  interactions  among  amino  acid  side  chains  are  impor¬ 
tant  in  constraining  the  positions  of  Cu(II)  ligands. 


Model  2 

Model  1  was  extended  by  adding  the  backbone  linkage 
between  Y495  and  H496  and  the  a-carbons  of  each  residue 
to  take  into  account  steric  constraints  from  the  protein 
backbone.  Optimization  of  2  was  performed  by  freezing  all 
a-carbons  at  their  crystallographic  positions.  This  resulted 
in  a  tetragonal  copper  center  similar  to  in  I,  but  the  axial 
Y495  and  C228  ligands  moved  away  considerably  from 
the  copper  center  (Cu-0(Y495)  distance  3.47  A,  and 
Cu-S(C228)  distance  3.56  A;  Fig.  5a).  Also,  the  position 
of  the  equatorial  H20  is  slightly  above  the  equatorial  plane 
and  the  H20  forms  a  hydrogen-bonding  interaction  with  the 
oxygen  atom  of  the  axial  Y495  (0-0  distance  2.67  A).  The 
average  bond  distances  of  1.98  A  (Cu-N  distance  2.00  A 
and  Cu-O  distance  1 .94  A)  in  2  are  in  reasonable  agree¬ 
ment  with  the  distances  obtained  from  EXAFS  study 
(Cu-N  distance  1.97  A  and  Cu-0  distance  1.96  A). 

The  calculated  electronic  structure  of  2  suggests  a 
slightly  more  covalent  bonding  than  in  1  involving  a 
3 dxi_y2  orbital  with  0.67  e~  a-spin  density  located  on 
the  Cu(II)  ion,  and  the  remaining  0.34  e~  spin  density 
delocalized  onto  the  cr-bonded  equatorial  ligands  [Y272 
(0.15  0,  H496  (0.06  0,  H581  (0.08  0,  and  H20 
(0.05  Ok  The  protein  radical  (/?-spin)  is  mainly  localized 
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Fig.  5  Optimized  structure  of  model  2.  a  Cu-ligand  distances 
(angstroms)  and  ligand-Cu-ligand  bond  angles  (degrees),  b  Spin- 
density  plot  (isosurface  of  ±0.003)  and  fragment  spin  densities 


onto  a  u  orbital  of  axial  Y495  (Fig.  5b).  The  calculated 
anti  ferromagnetic  coupling  J  (approximately  7  cm-1)  is 
almost  negligible,  yielding  nearly  degenerate  triplet  and 
singlet  states.  Hence,  the  calculated  electronic  structure  of 
2  is  inconsistent  with  the  experimental  data  in  the  location 
of  protein  radical,  the  negligible  singlet-triplet  energy  gap, 
and  the  inaccurate  electronic  transitions  (Fig.  SI). 


Model  3 

We  reasoned  that  for  changing  the  location  of  the  Tyr 
radical  from  axial  to  equatorial  it  would  be  desirable  to 
reduce  the  nucleophilicity  of  the  axial  Tyr  (Y495)  ligand 
and  thus  destabilize  a  radical  at  this  position.  Accordingly, 
model  2  was  extended  via  addition  of  a  hydrogen-bonding 
water  molecule,  which  is  resolved  in  the  crystal  structure 
[3].  The  optimized  3  with  the  constraints  as  for  2  resulted 
in  a  tetragonal  pyramidal  copper  site  with  all  equatorial 
ligands  located  within  1.89-2.0  A  distances;  the  shortest 
being  the  Cu-0(Y272)  (Fig.  6a).  The  axial  Y495  is  located 
at  2.49  A  from  the  copper  center,  and  the  Tyr  ring  is  almost 
parallel  to  the  equatorial  plane.  The  electronic  structure  of 
3  shows  small  changes  in  the  electron  spin  distribution 
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Fig.  6  Optimized  structure  of  model  3.  a  Cu-ligand  distances 
(angstroms)  and  ligand-Cu-ligand  angles  (degrees),  b  Spin-density 
plot  (isosurface  of  ±0.003)  and  c  spin-density  plot  of  Y495 
protonated  in  [Cu(II)-Y»]  model  3  (isosurface  of  ±0.003) 

despite  the  geometric  difference  relative  to  2,  and  results  in 
a  triplet  ground  state  (Fig.  6b). 

In  order  to  further  probe  the  idea  that  the  reduced 
nucleophilicity  of  axial  Y495  could  tune  the  location  of  the 
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Fig.  7  Optimized  structure  of  model  4a.  a  Cu-ligand  distances 
(angstroms,  solid  lines )  and  hydrogen-bonding  interaction  (ang¬ 
stroms,  dotted  lines)  and  ligand-Cu-ligand  angles  (degrees),  b  Spin- 
density  plot  (isosurface  of  ±0.003) 

protein  radical,  we  evaluated  the  possibility  of  the  0(Y495) 
being  protonated.  It  is  worth  noting  that  Himo  et  al.  [11] 
also  reported  that  in  the  substrate-bound  form  of  oxidized 
GO  the  radical  is  transferred  to  the  equatorial  Tyr-Cys 
cross-link  simultaneously  with  the  proton  transfer  to 
0(Y495)  from  the  substrate.  In  our  case  without  substrate 
present,  a  single  point  energy  calculation  for  the  protonated 
3  resulted  in  a  spin  distribution  that  is  consistent  with 
experiment.  For  the  first  time,  the  radical  (unpaired  /2-spin 
electron)  is  located  on  an  out-of-plane  n  orbital  of  the  Tyr- 
Cys  cofactor  and  the  unpaired  a-spin  electron  is  localized 
on  the  Cu  3dx2_yi  orbital  and  the  in-plane  orbitals  of  the 
covalently  bound  equatorial  ligands  (Fig.  6c).  This  result 
motivated  us  to  assess  the  possibilities  of  other  hydrogen¬ 
bonding  residues  around  the  oxygen  atom  of  Y495,  leading 
to  model  4. 


Model  4 

The  oxygen  atom  of  the  Tyr  (Y405)  residue  is  located  at 
approximately  3.25  A  away  from  the  0(Y495)  and  it  is  well 


positioned  for  hydrogen  bonding  with  Y495.  Furthermore,  a 
positively  charged  Arg  residue  (R330)  is  located  within 
hydrogen-bonding  range  of  H20  (294),  which  could  stabi¬ 
lize  the  position  of  this  water  molecule.  Note  that  the 
position  of  H20  (294)  in  model  3  changed  considerably  by 
moving  toward  Y272  (^o_o(Y272)  =  2.66  A)  and  forming  a 
stronger  hydrogen-bonding  interaction  with  Cu-coordinated 
H2O  (do_o  =  2.58  A)  relative  to  the  crystal  structure 
Wo-0(Y272)  =  3.02  A  and  do_o  =  3.01  A).  Therefore, 
model  4  consists  of  protonated  second  coordination  sphere 
residues  Y405  and  R330  in  addition  to  the  residues  included 
in  3  (Fig.  2).  Initially,  geometric  optimization  of  4  was 
performed  with  fixed  crystallographic  positions  for  a-car- 
bons  of  each  residue  (4a).  As  shown  in  Fig.  7a,  model  4a 
has  a  tetragonally  distorted  pyramidal  coordination  geom- 

o 

etry  with  H20  in  an  axial  position  at  2.49  A,  whereas  Y495 
now  occupies  an  equatorial  position.  The  equatorial  plane  is 
defined  by  the  four  shortest  Cu-ligand  distances 
(1.94-2.06  A),  with  cis  and  trans  ligand-Cu-ligand  bond 
angles  of  91-105°  and  1 14  and  153°,  respectively  (Fig.  7a). 
With  respect  to  the  electronic  structure,  this  means  that  the 
equatorial  plane  containing  the  3dy2_yi  orbital  is  rotated  by 
90°  relative  to  that  in  models  1-3. 

The  calculated  electronic  structure  of  the  oxidized 
[Cu(II)-Y«]  state  of  4  suggests  a  metal -centered  bonding 
description  involving  a  3dx2_y2  orbital  with  0.76  e~  a-spin 
density  located  on  the  Cu(II)  ion,  and  the  remaining 
0.24  e~  is  delocalized  evenly  onto  the  rx-bonded  equato¬ 
rial  ligands  (Fig.  7b).  The  protein  radical  (Tyr»-Cys) 
has  approximately  0.94  e~  /J-spin  density  in  an  out-of¬ 
plane  n  orbital,  including  approximately  0.20  e~  spin 
delocalization  onto  the  sulfur  atom  of  C228.  The  possible 
involvement  of  S  orbitals  in  the  protein  radical  predicted  by 
this  model  parallels  the  conclusions  from  studies  of  bio- 
mimetic  GO  models  [14,  40]. 

An  alternative  optimization  of  oxidized  4  was  performed 
with  additional  distance  constraints  from  EXAFS  [both 
Cu-N  distance  1.97  A  and  Cu-0(Y272)  distance  1.96  A] 
in  addition  to  fixing  a-carbons  at  their  crystallographic 
positions  (4b).  Interestingly,  this  approach  produced  a  tri- 
gonally  distorted  tetrahedral  Cu(II)  geometry  with  ligand- 
Cu-ligand  angles  spanning  from  99  to  125°  (Fig.  8a), 
which  is  quite  different  from  the  situation  for  4a,  and 
resembles  the  angular  parameters  of  blue  copper  proteins 
[37].  The  equatorial  H20  in  4b  is  quite  distant  from  the 
copper  center  at  3.42  A.  Both  models  (4a  and  4b)  agree 
with  EXAFS  results  but  differ  in  their  electronic  structures. 
In  4b  the  Cu(II)  spin  density  was  increased  to  approxi¬ 
mately  0.86  e~  a-spin,  corresponding  to  less  covalent 
bonding  (Fig.  8b).  The  delocalization  of  the  protein  radical 
(Tyr*-Cys)  onto  the  sulfur  atom  of  C228  was  also  reduced 
by  about  4%  (Fig.  8b).  The  calculated  antiferromagnetic 
coupling  constants  are  923  and  39  cm"1  for  4a  and  4b, 
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Fig.  8  Optimized  structure  of  model  4b.  a  Cu-ligand  distances 
(angstroms,  solid  lines),  hydrogen-bonding  interactions  (angstroms, 
dotted  lines),  and  ligand-Cu-ligand  angles  (degrees),  b  Spin-density 
plot  (isosurface  of  ±0.003) 


3.17  A  from  the  copper  center,  and  almost  parallel  to  the 
equatorial  plane  of  the  copper  center.  Unexpectedly,  R330 
and  Y405  residues  were  distorted  significantly  in  the 
optimized  structure  relative  to  the  crystal  structure,  in 
contrast  to  the  results  for  4.  W290  was  also  dislocated  from 
the  stacking  plane.  Moreover,  the  electronic  structure  of  5 
is  inconsistent  with  experiment  since  the  protein  radical 
was  mainly  localized  on  a  n  orbital  of  the  axial  Y495 
(Fig.  S2). 

These  results  for  model  5  indicate  that  the  n-n  stacking 
interaction  of  the  cross-linked  Tyr-Cys  cofactor  and  W290 
can  have  a  significant  influence  on  the  electronic  structure 
of  the  [Cu(II)-Y«]  state.  We  inferred  that  the  protein 
environment  must  modulate  the  interaction  between  W290 
and  the  Tyr-Cys  cofactor.  Inspection  of  the  crystal  struc¬ 
ture  revealed  that  both  the  axial  Y495  and  the  indole 
nitrogen  of  W290  are  partially  exposed  to  solvent.  Because 
our  computational  work  already  revealed  that  hydrogen 
bonding  can  modulate  the  electron  spin  distribution  in  the 
GO  active  site,  we  introduced  an  explicit  water  solvent 
molecule  that  is  within  hydrogen-bonding  distance  of  the 
indole  nitrogen  of  W290  or  of  the  oxygen  of  Y495  in 
separate  models.  Only  the  latter  hydrogen-bonding  inter¬ 
action  resulted  in  a  protein  radical  on  the  cross-linked 
Y272-C228  cofactor  defining  our  model  6. 


respectively.  It  is  also  important  to  note  that  model  4a  is 
approximately  68  kJ  mol-1  lower  in  energy  than  4b; 
therefore,  model  4a  was  considered  a  more  reasonable 
model  than  4b.  TD-DFT  analysis  was  performed  for  4a  and 
predicted  three  intense  features  at  766,  390,  and  372  nm, 
giving  rather  close  agreement  with  the  experimental  optical 
transition.  They  can  be  assigned  to  tyrosinate  (7i3  — ►  7r4), 
Y272  (tt2  -  7T4),  and  Y272  (tc3)  -  <x*  of  Cu(II)  3dx2_y2 
transitions,  respectively. 

Model  5 

While  4a  can  already  be  considered  as  a  reasonable  com¬ 
putational  model  for  the  [Cu(II)-Y®]  state  of  the  GO  active 
site,  it  lacks  a  key  residue  (W290)  that  has  been  shown  to 
influence  the  catalytic  activity  by  affecting  the  generation 
of  the  tyrosyl  radical  (Y272)  in  GO  [41].  In  the  crystal 
structure,  it  also  appears  that  the  conformation  of  Y272  is 
affected  by  a  n  stacking  interaction  with  W290.  This 
stacking  interaction  was  included  in  model  5  to  evaluate 
the  effect  of  Trp  on  the  geometric  and  the  electronic 
structure  of  the  [Cu(II)— Y •  ]  state.  The  optimized  structure 
of  oxidized  5  resulted  in  a  square-planar  copper  site  with 
all  equatorial  ligands  located  within  1.87-2.04  A  distance, 
the  shortest  being  Cu-O  (Y272).  The  axial  Y495  is  located 


Model  6 

The  spin-polarized  wave  function  of  6  was  obtained  from 
two  different  starting  points  owing  to  issues  in  self-con¬ 
sistent-field  convergence:  merging  ionic  wave  functions 
from  individual  fragments;  or  adding  the  W290  residue  and 
the  water  molecule  to  a  covalent,  spin-polarized  wave 
function  of  4.  The  latter  resulted  in  a  lower-energy  wave 
function  (atomic  spin  densities  of  0.74  e~  on  Cu,  —0.60  e~ 
on  Y272,  -0.15  e~on  Y495,  and  -0.1 0e~  on  W290)  by 
approximately  104  kJ  mol-1  relative  to  the  former  wave 
function  (atomic  spin  densities  of  0.99  e~  on  Cu,  —0.75  e~ 
on  Y272,  -0.14  on  Y495,  and  -0.09e~  on  W290).  It  is 
worth  noting  that  these  results  demonstrate  well  potential 
pitfalls  for  finding  the  ground-state  electronic  state  in  large 
computational  models  from  single-point  calculations.  The 
generalized  ionic  fragment  approach  may  avoid  these  pit- 
falls  by  allowing  for  generating  all  possible  valence-bond 
structures  as  initial  wave  functions  in  a  systematic  and 
reproducible  manner.  Optimization  of  6  starting  from  the 
lower-energy  spin-polarized  wave  function  was  performed 
with  additional  constraints  of  R330  (($N,  eC,  and  yN)  in 
order  to  avoid  the  unreasonable  distortion  of  R330  that  was 
observed  in  5.  The  optimized  structure  of  6  reveals  a  five- 
coordinated  square-pyramidal  Cu  coordination  geometry 
with  Y495  in  an  axial  position  and  the  other  ligands  in 
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equatorial  positions  with  cis  and  trans  ligand-Cu-ligand 
bond  angles  of  77-94°  and  155  and  159°,  respectively 
(Fig.  9a).  The  overall  coordination  environment  of  the 
copper  site  in  [Cu(II)-Y»]  is  comparable  to  that  in  the 
crystal  structure  of  the  [Cu(II)-Y]  state  with  some  devia¬ 
tions  in  the  Cu-ligand  bond  distances.  It  is  interesting  to 
note  that  the  average  Cu-N  and  Cu-0  distances  (1.99  and 

o 

2.06  A)  also  slightly  deviate  from  the  corresponding  EX- 
AFS  values.  The  calculated  electronic  structure  of  the 
oxidized  [Cu(II)-Y*]  state  of  6  suggests  a  metal-based 
bonding  description  involving  a  3dx2_y2  orbital  with 
0.76  e~  a-spin  density  located  on  the  Cu(II)  ion.  The  Tyr*- 
Cys  radical  has  approximately  0.9  le-  /J-spin  density  in  an 
out-of-plane  n4  orbital  (Fig.  9b;  for  orbital  labeling  see 
Fig.  S3).  This  includes  approximately  0.24  e~  spin  delo¬ 
calization  onto  the  sulfur  atom  of  C228.  The  n-n  stacking 
between  W290  and  Tyr«-Cys  (approximately  3.7  A),  and 
constraints  from  a-carbon  atoms  of  Y272,  C228,  and  W290 
maintain  a  cofactor  orientation  (Cu-O-C  bond  angle, 
a  =  125°  and  dihedral  angle  between  equatorial  plane  of 
Cu  coordinated  ligands  and  tyrosyl  ring  plane,  ft  82°) 
that  results  in  a  diamagnetic  ground  state  [22]. 

It  is  important  to  mention  that  the  proton  on  the  indole  N 
of  W290  is  bent  and  about  0.03  e~  spin  density  is  located 
on  the  Trp  ring.  A  Kohn-Sham  molecular  orbital  energy 
diagram  for  the  oxidized  model  is  shown  in  Fig.  9c.  The 
electronic  transitions  between  HOMO-1  and  LUMO  orbi¬ 
tals  at  31 1  and  601  nm  in  the  spin-down  (/?)  and  the  spin- 
up  (a)  manifolds,  respectively,  can  be  assigned  to  LMCT 
and  LLCT  excitations.  The  former  corresponds  to  the  ty- 
rosinate  (Y495)  to  Cu  3dx2_ya  orbital, 

7c4  -►  ex*,  and  the  latter  to  tyrosinate  (Y495)  to  tyrosyl 
(Y272)  radical,  n4  ->  n4  (Fig.  9c).  TD-DFT  analysis  gave 
an  improved  agreement  between  the  experimental  and 
calculated  band  positions  and  intensities.  Nonzero  oscilla¬ 
tor  strengths  if)  were  obtained  for  transitions  at  795  nm 
( f  -  0.02,  radical  Y272  7r3  -►  7r4),  400  nm  (f  =  0.04,  radi¬ 
cal  Y272  n2  -  7 r4),  and  359  nm  \f  =  0.05,  H496  (n)  and 
Y272  (7i4)  to  d*  of  Cu(II)  3</v2_v2].  These  are  consistent 
with  assignments  based  on  electronic  absorption  and  res¬ 
onance  Raman  measurements  [42,  43]  except  that  the  TD- 
DFT  calculations  do  not  predict  substantial  contributions 
from  Y495,  as  inferred  from  the  resonance  Raman  exper¬ 
iments.  The  calculated  antiferromagnetic  coupling  is  about 
752  cm-1  for  6. 


Fig.  9  a  Optimized  structure  of  model  6  with  Cu-ligand  (angstroms, 
solid  lines),  hydrogen-bonding  (angstroms,  dotted  lines )  distances  and 
ligand-Cu-ligand  angles  (degrees),  b  Spin-density  plot  (isosurface  of 
±0.003).  c  Relative  energies  of  Kohn-Sham  molecular  orbitals  {top) 
and  HOMO-1  and  LUMO  (isosurface  of  ±0.05)  of  both  a  and  p 
manifolds  ( bottom ) 


Model  7 

In  order  to  demonstrate  that  model  6  is  indeed  a  converged 
structural  model  for  oxidized  GO,  which  already  provides 
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Fig.  10  a  Optimized  structure  of  [Cu(II)-Y*]  model  7  with  Cu- 
ligand  (angstroms,  solid  lines )  and  hydrogen-bonding  (angstroms, 
dotted  lines)  distances  and  ligand-Cu-ligand  angles  (degrees), 
b  Spin-density  plot  (isosurface  of  ±0.003) 

good  agreement  with  experiment,  we  further  extended  this 
computational  model  to  214  atoms.  This  extended  model 
contains  second  and  third  coordination  sphere  residues: 
Q406,  H334,  Y329,  T580,  G513,  and  additional  H20  that 
are  located  within  hydrogen-bonding  distance  of  R330, 
Y405,  R330,  H581  and  H496,  respectively  (Fig.  2).  The 
optimized  structure  reveals  a  five-coordinate  square  pyra¬ 
midal  geometry  highly  similar  to  6  without  significant 
distortion  of  any  residues  (Fig.  10a)  relative  to  their  crys¬ 
tallographic  positions. 

The  Q406  residue  that  keeps  the  guanidium  group  of 
R330  in  place  was  replaced  by  defining  constraints  for  <5N, 
eC,  and  /N  atoms  of  R330.  Importantly,  the  calculated 
electronic  structure  for  the  [Cu(II)-Y«]  state  is  identical  to 


model  6  as  well  (Fig.  10b).  The  antiferromagnetic  coupling 
constant  J  was  only  slightly  affected  (approximately 
608  cm-1)  with  the  same  preference  of  the  open-shell 
singlet  ground  state  relative  to  the  triplet  state.  Thus,  model 
6  provides  a  sufficient  description  of  the  structure  of  the 
[Cu(II)-Y*].  Model  7  may  be  needed  in  future  investiga¬ 
tion  of  the  molecular  mechanism  of  the  GO  chemical 
function. 


Redox  dependence  of  the  active-site  structure 

While  model  6  has  been  shown  to  be  the  converged  com¬ 
putational  model  that  reasonably  reproduces  key 
experimental  data  for  the  [Cu(II)-Y«]  state  of  GO,  the 
redox  dependence  of  the  active-site  structure  required  a 
minor,  yet  important  modification  to  this  model.  The 
change  in  the  total  charge  of  the  computational  model  that 
accompanies  reduction  and  the  concomitant  decrease  of 
covalent  interactions  between  the  Cu  center  and  its 
ligands  required  the  introduction  of  additional  geometric 
constraints  to  6  that  prevent  unreasonable  displacements 
of  residues  relative  to  their  crystallographic  positions. 
These  additional  constraints  on  /?-  and  y-C  atoms  of 
W290  affect  the  optimized  model  of  the  [Cu(II)-Y»]  state 
negligibly.  The  introduction  of  these  constraints  is  not 
arbitrary  as  they  make  up  for  the  absence  of  hydrogen¬ 
bonding  interactions  from  the  Y329  residue  that  is  present 
explicitly  in  model  7. 

The  potential  energy  surface  of  the  three  redox  states 
were  rigorously  probed  as  summarized  in  Fig.  3.  In  the 
following  we  will  only  discuss  the  lowest-energy,  final 
stationary  structures  obtained  for  each  redox  step. 

Semireduced  state  of  GO  [Cu(II)-Y] 

Geometry  optimizations  from  different  starting  points 
resulted  in  three  different  stationary  points  that  are  con¬ 
nected  with  a  distortion  coordinate  along  the  Cu-0(Y495) 
and  the  Cu-OH2  bonds.  This  distortion  coordinate  spans  a 
tetragonal  Cu(II)  coordination  geometry  with  short  Cu- 
OH2  (2.09  A)  and  long  Cu-0(Y495)  (2.30  A)  distances,  a 
structure  with  equivalent  Cu-0  distances  (2.16  A),  and  the 
lowest-energy  semireduced  [Cu(II)-Y]  structure,  which 
possesses  a  long  Cu-OH2  (2.24  A)  and  a  short  Cu- 
0(Y495)  (2.05  A)  bond  (Table  SI).  The  energy  differences 
among  these  models  are  within  the  accuracy  of  the  selected 
level  of  theory  (approximately  4.4  kJ  mol-1).  Going  from 
the  oxidized  to  the  semireduced  state,  we  find  the  most 
significant  change  is  the  shorter  Cu-0(Y272)  distance 
(2.02  vs.  1.92  A)  (Figs.  9a,  1 1).  The  calculated  average 
Cu-O  and  Cu-N  distances  (1.99-2.05  and  2.00-2.05  A) 
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Fig.  11  Optimized  structure  of  [Cu(II)-Y]  from  [Cu(II)-Y»]  6  with 
Cu-ligand  (angstroms,  solid  lines )  and  hydrogen-bonding  (angstroms, 
dotted  lines)  distances  and  ligand-Cu-ligand  angles  (degrees) 


are  slightly  longer  than  the  reported  EXAFS  [5]  distances 
( 1 .97  and  1 .95  A).  Independent  of  the  position  of  a  model 
structure  along  the  distortion  coordinate,  the  atomic  spin 
densities  remain  similar:  about  0.72-0.75  e~  on  the  Cu(II) 
ion,  with  the  remaining  0.25  e~  distributed  onto  the 
equatorial  ligands  (Y272,  0.10-0.12  e~ ;  H496,  H581, 
0.04-0.07  e";  H20,  0.02-0.04  e~).  TD-DFT  analysis 
revealed  that  electronic  transitions  of  these  three  structures 
are  similar  in  energy  to  the  experimental  absorptions  at  468 
and  625  nm,  which  correspond  to  LMCT  [Y272  (7i)  to  <7* 
of  Cu(II)  3dx2_y2]  and  ligand-field  transitions,  respectively 
(Table  S2). 

The  semireduced  model  (7)  also  parallels  the  result 
obtained  from  semireduced  model  6  (Fig.  S4,  Table  S3); 
however,  the  rotation  of  W290  is  observed  even  in  this 
extended  model.  Additional  constraints  (/J-carbons  of  Y272 
and  W290)  were  applied  to  prevent  this  structural  change, 
which  had  insignificant  influence  on  the  coordination 
geometry  and  the  electron  spin  density  distribution. 

As  a  further  validation  of  the  semireduced  computa¬ 
tional  model,  we  determined  the  one-electron  redox 
potential  for  the  [Cu(II)-Y«]/[Cu(II)-Y]  couple.  The  ioni¬ 
zation  potential  of  [Cu(II)-Y#]/[Cu(II)-Y]  is  7.88  V  and 
by  increasing  the  dielectric  constant  of  the  polarizable 
continuum  surrounding  model  6,  this  decreases  closer  to 
the  experimental  value.  The  calculated  redox  potential  is 
691  mV  with  a  dielectric  constant  of  10,  which  is  a  typical 


dielectric  constant  for  a  buried  redox-active  site  (Table 
S4).  However,  the  GO  active  site  is  partially  solvent 
exposed  and  thus  it  is  not  surprising  that  a  higher  dielectric 
constant  (e  =  40)  gives  a  better  agreement  (calculated, 
439  mV:  experimental,  400  mV)  [29].  The  good  agreement 
between  the  experimental  and  the  calculated  spectroscopic 
features,  average  Cu-O/Cu-N  bond  lengths,  and  redox 
potentials  suggests  that  model  6  captures  most  of  the 
important  structural  features  and  interactions  in  the  protein 
environment  for  both  the  oxidized  and  the  semireduced 
states  of  GO. 


Reduced  state  of  GO  [Cu(I)-Y] 

Starting  from  various  initial  structures  for  the  reduced  state 
(Fig.  3),  we  located  stationary  points  along  a  similar  dis¬ 
tortion  coordinate  as  described  for  the  semireduced  state. 
These  range  from  a  Cu(I)  site  with  a  trigonal  pyramidal 
coordination  geometry  (higher  energy)  to  the  almost  tri¬ 
gonal  planar  structure,  where  the  Cu(I)  is  coordinated  by 
two  His  N  atoms  and  a  Tyr  O  atom.  The  average  Cu-(N/0) 
bond  distance  of  2.01  A  (Fig.  12a,  Table  S5)  is  close  to  the 
experimental  EXAFS  distance  of  1.99  A  [6].  The 
n  stacking  of  W290  and  the  Tyr-Cys  cofactor  is  perturbed 
as  expected  from  the  increased  electrostatic  repulsion 
between  the  two-electron-reduced  active  site  and  the 
electron-rich  W290.  For  model  7  only  the  trigonal  planar 
structure  (Table  S6)  was  localized  as  a  stationary  point, 
with  a  W290  rotated  away  from  the  7r-stacking  plane  owing 
to  rotation  along  the  fC-yC  bond  in  the  W290  residue.1 

The  redox  potential  calculated  using  a  dielectric  con¬ 
stant  of  40  for  the  [Cu(II)-Y]/[Cu(I)-Y]  couple  is 
approximately  930  mV  more  negative  than  the  experi¬ 
mental  value  of  160  mV  [29].  This  is  a  rather  large 
difference  and  suggests  that  the  reduced  form  may  be 
protonated.  Furthermore,  Wright  and  Sykes  [29]  have 
described  the  pH  dependence  of  the  redox  potential  for  the 
[Cu(II)-Y]/[Cu(I)-Y]  couple.  In  order  to  investigate  this, 
the  reduced  structure  was  protonated  at  0(Y495)  and  in  a 
separate  model  at  0(Y272).  Both  protonated  states  gave  a 
three-coordinated  Cu  site  (Fig.  12b,  c),  as  also  found  by 
X-ray  absorption  spectroscopy  [6],  with  Cu(I)  ions  coor¬ 
dinated  by  two  N  from  His  residues  and  O  of  tyrosinate, 
resulting  in  trigonal  pyramidal  structures.  The  redox 


1  It  is  important  to  note  that  the  large  number  of  constraints  employed 
for  the  optimized  structures  for  both  model  6  and  model  7  seem  to 
have  a  negligible  effect  on  the  coordination  geometry,  yet  rotation  of 
W290  is  observed  for  the  reduced  structure  owing  to  the  conformation 
flexibility  of  the  pC-yC  bond  of  W290.  Since  W290  is  partially 
exposed  to  the  solvent,  it  is  quite  likely  that  the  distortion  observed  in 
the  computational  model  can  occur  upon  complete  reduction  of  the 
GO  active  site  in  a  protein  environment. 
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Fig.  12  Optimized  structures  of  a  [Cu(I)-Y],  b  Y495-protonated 
[Cu(I)-Y],  and  c  Y272-protonated  [Cu(I)-Y]  with  Cu-ligand  (ang¬ 
stroms,  solid  lines )  and  hydrogen-bonding  (angstroms,  dotted  lines ) 
distances  and  ligand-Cu-ligand  angles  (degrees) 

potentials  for  the  semireduced/Y495-protonated  and  semi- 
reduced/ Y272-protonated  reduced  couples  in  low  dielectric 
medium  (s  =  10)  were  calculated  to  be  —163  and  —40  mV, 
respectively,  which  are  closer  to  the  experimental  value  of 
1 60  mV  than  was  the  redox  potential  of  the  nonprotonated, 
reduced  form.  A  tetrasolvated  hydronium  (H30+)  ion  was 
used  as  the  source  for  the  protons  with  an  energetic  cost  of 
11.9  eV. 

Adjustment  of  the  dielectric  constant  towards  more 
polar  environments  (£  =  40)  only  slightly  affected  these 


redox  potentials  (—173  and  —54  mV  for  Y495  and  Y272 
protonated  states,  respectively).  While  these  potentials  are 
somewhat  less  accurate  than  that  of  the  oxidized/semire- 
duced  couple,  the  deviation  of  300  mV  between  the 
calculated  and  the  experimental  potentials  is  already  a 
rather  reasonable  agreement  for  a  DFT  computational 
model  without  the  explicit  consideration  of  point  charges 
and  dipoles  from  the  rest  of  the  protein  environment. 

Furthermore,  the  redox  potentials  calculated  for  a  low 
dielectric  medium  (e  =  10)  for  the  catalytically  relevant 
two-electron  redox  couples  of  [Cu(II)-Y®]/[Cu(I)-HY495] 
and  [Cu(II)-Y •]/[Cu(I)-H Y272]  are  264  and  325  mV, 
respectively.  These  values  are  in  close  vicinity  of  the 
experimental  value  (approximately  275  mV).  In  a  more 
polar  environment  (e  =  40),  these  calculated  redox  poten¬ 
tials  shift  towards  less  positive  values  (133  and  193  mV) 
with  a  deviation  of  80-140  mV  from  the  experimental 
values. 


Conclusion 

It  has  already  been  well  documented  experimentally  by 
site-directed  mutations  that  residues  of  the  second  coordi¬ 
nation  shell  influence  substrate  binding  and  catalysis  in 
GO.  Our  current  computational  study  parallels  these  site- 
directed  studies  in  emphasizing  the  importance  of  the 
contributions  from  second  coordination  sphere  residues. 
Furthermore,  we  would  like  to  put  this  study  forward  as  an 
example  for  evaluating  contributions  from  the  protein 
matrix  to  the  active-site  structure  and  catalytic  activity. 
Inclusion  of  outer-sphere  ligands  in  the  in  silico  models 
was  crucial  to  obtain  the  spectroscopically  correct  elec¬ 
tronic  structure  for  the  active  site  of  GO.  The  hydrogen¬ 
bonding  interactions  involving  the  O  atom  of  Y495  and  the 
n-n  stacking  of  W290  and  the  Tyr«-Cys  cofactor  play  a 
critical  role  in  modulating  the  spin-density  distribution 
around  the  Cu  site  and  thus  the  molecular  orbitals  involved 
in  catalysis.  Since  Y495  and  W290  are  both  exposed  to 
water  solvent,  inclusion  of  hydrogen-bonding  interactions 
from  explicit  solvent  water  molecules  with  these  residues  is 
also  essential.  Our  oxidized  [Cu(II)-Y*]  models  (the 
smallest  required  model  6  and  an  optimal  model  7)  with  a 
tetragonal  square-pyramidal  active-site  geometry  and  with 
a  radical  centered  on  the  Tyr-Cys  cofactor  gives  good 
agreement  with  the  experimentally  observed  optical  tran¬ 
sitions  and  magnetic  coupling  constant.  For  an  overview, 
the  location  of  the  protein  radical,  magnetic  coupling  and 
electronic  transitions  from  all  the  computational  models  are 
provided  as  supplementary  material  (Table  S7). 

Using  these  models,  we  have  developed  the  structure 
of  the  fully  reduced  [Cu(I)-Y]  state,  which  is  also  impli¬ 
cated  in  the  catalytic  mechanism,  as  well  as  the  resting 
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inactive,  semireduced  [Cu(II)-Y]  state.  The  lowest-energy 
semi  reduced  and  fully  reduced  states  were  shown  to  have 
distorted  tetragonal  and  trigonal  planar  Cu  sites,  respec¬ 
tively.  We  were  able  to  estimate  reasonably  well  the 
catalytically  relevant  redox  potentials  for  the  [Cu(II)-Y®]/ 
[Cu(I)-HY]  couple  (133-193  mV;  experimental,  approxi¬ 
mately  275  mV)  in  an  intermediate  dielectric  environment 
with  £  =  40.  The  protonation  states  of  axial  Y495  and  the 
equatorial  Y272  were  found  to  be  critical  for  obtaining 
experimentally  sound  redox  potentials.  Similarly  to  X-ray 
absorption  spectroscopy  studies  on  the  substrate  reduced 
form,  both  protonated  states  show  a  three-coordinate  cop¬ 
per  site.  Overall  model  6  is  the  smallest  converged  model 
for  the  GO  active  site  which  captures  all  the  essential 
elements  for  the  active  site  of  GO  and  7  is  sufficiently 
extended  to  validate  conclusions  based  on  6.  Our  compu¬ 
tational  work  sets  the  stage  for  a  detailed  investigation  of 
substrate  binding  and  the  catalytic  cycle  of  GO. 

Acknowledgements  We  gratefully  acknowledge  the  support  of  this 
research  by  NIH  grant  GM  27659  for  D.M.D.  and  by  the  Montana 
State  University  Thermal  Biology  Institute  (NASA  NNG04GR46G) 
and  the  ONR  Center  for  Bioinspired  Nanomaterials  (N000 14-06-0 1- 
1016)  for  R.K.S. 


References 

1.  Avigad  G,  Amaral  D,  Asensio  C,  Horecker  BL  (1962)  J  Biol 
Chem  237:2736-2743 

2.  Whittaker  JW  (2003)  Chem  Rev  103:2347-2363 

3.  Ito  N,  Phillips  SEV,  Stevens  C,  Ogel  ZB,  McPherson  MJ,  Keen 
JN,  Yadav  KDS,  Knowles  PF  (1991)  Nature  350:87-90 

4.  Whittaker  MM,  Whittaker  JW  (1988)  J  Biol  Chem  263:6074- 
6080 

5.  Knowles  PF,  Brown  RD,  Koenig  SH,  Wang  S,  Scott  RA, 
McGuirl  MA,  Brown  DE,  Dooley  DM  (1995)  Inorg  Chem 
34:3895-3902 

6.  Clark  K,  Penner-Hahn  JE,  Whittaker  M,  Whittaker  JW  (1994) 
Biochemistry  33:12553-12557 

7.  Whittaker  JW  (2002)  Adv  Protein  Chem  60:1-49 

8.  Whittaker  MM,  Whittaker  JW  (2000)  Protein  Expr  Purif  20:105- 
111 

9.  Wachter  RM,  Branchaud  BP  (1998)  Biochim  Biophys  Acta 
Protein  Struct  Mol  Enzymol  1384:43-54 

10.  Rothlisberger  U,  Carloni  P,  Doclo  K,  Parrinello  M  (2000)  J  Biol 
Inorg  Chem  5:236-250 

1 1.  Himo  F,  Eriksson  LA,  Maseras  F,  Siegbahn  PEM  (2000)  J  Am 
Chem  Soc  122:8031-8036 

12.  Babcock  GT,  El-Deeb  MK,  Sandusky  PO,  Whittaker  MM, 
Whittaker  JW  (1992)  J  Am  Chem  Soc  114:3727-3734 

1 3.  McGlashin  ML,  Eads  DD,  Spiro  TG,  Whittaker  JW  (1995)  J  Phys 
Chem  99:4918^922 

14.  Gerfen  GJ,  Bellew  BF,  Griffin  RG,  Singel  DJ,  Ekberg  CA, 
Whittaker  JW  (1996)  J  Phys  Chem  100:16739-16748 


15.  Whittaker  MM,  Chuang  YY,  Whittaker  JW  (1993)  J  Am  Chem 
Soc  115:10029-10035 

16.  Halfen  JA,  Young  VG  Jr,  Tolman  WB  (1996)  Angew  Chem  Int 
Ed  Engl  35:1687-1690 

17.  Halfen  JA,  Jazdzewski  BA,  Mahapatra  S,  Berreau  LM,  Wilkinson 
EC,  Que  L  Jr,  Tolman  WB  (1997)  J  Am  Chem  Soc  1 19:8217- 
8227 

18.  Chaudhuri  P,  Hess  M,  Weyhermueller  T,  Wieghardt  K  (1999) 
Angew  Chem  Int  Ed  Engl  38:1095-1098 

19.  Chaudhuri  P,  Hess  M,  Mueller  J,  Hildenbrand  K,  Bill  E,  Wey¬ 
hermueller  T,  Wieghardt  K  (1999)  J  Am  Chem  Soc  121:9599- 
9610 

20.  Chaudhuri  P,  Hess  M,  Flowers  L,  Wieghardt  K  (1998)  Angew 
Chem  Int  Ed  Engl  37:2217-2220 

21.  Kruse  T,  Weyhermuller  T,  Wieghardt  K  (2002)  Inorg  Chim  Acta 
331:81-89 

22.  Muller  J,  Weyhermuller  T,  Bill  E,  Hildebrandt  P,  Ould-Moussa 
L,  Glaser  T,  Wieghardt  K  (1998)  Angew  Chem  Int  Ed  Engl 
37:616-619 

23.  Szilagyi  RK,  Solomon  El  (2002)  Curr  Opin  Chem  Biol  6:250- 
258 

24.  Szilagyi  RK,  Metz  M,  Solomon  El  (2002)  J  Phys  Chem  A 
106:2994-3007 

25.  Szilagyi  RK,  Winslow  M  (2006)  J  Comput  Chem  27:1384-1397 

26.  Ito  N,  Phillips  SEV,  Yadav  KDS,  Knowles  PF  (1994)  J  Mol  Biol 
238:794-814 

27.  Rogers  MS,  Knowles  PF,  Baron  AJ,  McPherson  MJ,  Dooley  DM 
(1998)  Inorg  Chim  Acta  275-276:175-181 

28.  Rokhsana  D,  Dooley  DM,  Szilagyi  RK  (2006)  J  Am  Chem  Soc 
128:15550-15551 

29.  Wright  C,  Sykes  AG  (2001)  J  Inorg  Biochem  85:237-243 

30.  Frisch  MJ  et  al  (2005)  Gaussian  03,  revision  D  01.  Gaussian, 
Pittsburgh 

31.  Schaefer  A,  Horn  H,  Alrichs  R  (1992)  J  Chem  Phys  97:2571- 
2577 

32.  Rassolov  VA,  Pople  JA,  Ratner  MA,  Windus  TL  (1998)  J  Chem 
Phys  1223-1229 

33.  Francl  MM,  Hehre  WJ,  Binkley  JS,  Gordon  MS.  DeFrees  DJ, 
Pople  JA  (1982)  J  Chem  Phys  77:3654-3665 

34.  Hariharan  PC,  Pople  JA  (1973)  Theor  Chim  Acta  28:213-222 

35.  Ryde  U,  Olsson  MHM,  Pierloot  K  (2001)  Theor  Comput  Chem 
9:1-55 

36.  Farkas  O.,  Schlegel  HB  (1998)  J  Chem  Phys  109:7100-7104 

37.  Solomon  El,  Szilagyi  RK,  George  SD,  Basumallick  L  (2004) 
Chem  Rev  104:419-458 

38.  Stratmann  RE,  Scuseria  GE,  Lovell  T  (1998)  J  Chem  Phys 
109:8218-8224 

39.  Davidson  ER,  Clark  AE  (2005)  Int  J  Quantum  Chem  103:1-9 

40.  Itoh  S,  Taki  M,  Kumei  H,  Takayama  S,  Nagatomo  S,  Kitagawa  T, 
Sakurada  N,  Arakawa  R,  Fukuzumi  S  (2000)  Inorg  Chem 
39:3708-3711 

41.  Rogers  MS,  Tyler  EM,  Akyumani  N,  Kurtis  CR,  Spooner  RK, 
Deacon  SE,  Tamber  S,  Firbank  SJ,  Mahmoud  K,  Knowles  PF, 
Phillips  SE,  McPherson  MJ,  Dooley  DM  (2007)  Biochemistry 
46:4606-4618 

42.  McGlashen  ML,  Eads  DD,  Spiro  TG,  Whittaker  JW  (1995) 
J  Phys  Chem  99:4918^922 

43.  Whittaker  MM,  DeVito  VL,  Asher  SA,  Whittaker  JW  (1989) 
J  Biol  Chem  264:7104-7106 


Springer 


JACS 


ARTICLES 

Published  on  Web  03/07/2008 

Dithiomethylether  as  a  Ligand  in  the  Hydrogenase  H-Cluster 

Arti  S.  Pandey,  Travis  V.  Harris,  Logan  J.  Giles,  John  W.  Peters,*  and 

Robert  K.  Szilagyi* 

Department  of  Chemistry >  and  Biochemistry >  and  Astrobiology'  Biogeocatalysis  Research  Center, 
Montana  State  University \  Bozeman,  Montana  59717 

Received  December  17,  2007;  E-mail;  john.peters@chemistry.montana.edu;  szilagyi@montana.edu 


Abstract:  An  X-ray  crystallographic  refinement  of  the  H-cluster  of  [FeFe]-hydrogenase  from  Clostridium 
pasteurianum  has  been  carried  out  to  close-to  atomic  resolution  and  is  the  highest  resolution  [FeFe]- 
hydrogenase  presented  to  date.  The  1.39  A,  anisotropically  refined  [FeFe]-hydrogenase  structure  provides 
a  basis  for  examining  the  outstanding  issue  of  the  composition  of  the  unique  nonprotein  dithiolate  ligand 
of  the  H-cluster.  In  addition  to  influencing  the  electronic  structure  of  the  H-cluster,  the  composition  of  the 
ligand  has  mechanistic  implications  due  to  the  potential  of  the  bridge-head  y-group  participating  in  proton 
transfer  during  catalysis.  In  this  work,  sequential  density  functional  theory  optimizations  of  the  dithiolate 
ligand  embedded  in  a  3. 5-3. 9  A  protein  environment  provide  an  unbiased  approach  to  examining  the 
most  likely  composition  of  the  ligand.  Structural,  conformational,  and  energetic  considerations  indicate  a 
preference  for  dithiomethylether  as  an  H-cluster  ligand  and  strongly  disfavor  the  dithiomethylammonium 
as  a  catalytic  base  for  hydrogen  production. 


Introduction 

The  [FeFe]-  and  [NiFeJ-hydrogenases1*2  catalyze  the  revers¬ 
ible  hydrogen  oxidation  and  proton  reduction  reactions.  Crystal 
structures3*4  of  [FeFe]-hydrogenases  reveal  a  biologically  un¬ 
precedented  active  site  (H-cluster)  that  exists  as  a  [4Fe-4S] 
cubane  linked  to  a  2Fe-subcluster  via  a  cysteine  thiolate.  In 
addition,  the  2Fe-subcluster  is  coordinated  by  terminal  carbon 
monoxide  ligands  and  a  bridging  carbon  monoxide,  terminal 
cyanide  ligands,  and  a  unique  bridging  dithiolate  ligand. 
Although  it  has  been  generally  accepted  that  the  functional 
H-clusters  from  the  structurally  characterized  [FeFe]-hydroge- 
nases  from  Clostridium  pasteurianum 3  and  Desulfovibrio  des¬ 
ulfur  icans4  are  of  the  same  composition,  their  structures  differ 
by  the  coordination  environment  of  the  distal  Fe  atom  of  the 
2Fe-subcluster  by  the  presence  of  a  water  molecule.  This 
difference  has  been  attributed  to  the  crystallization  conditions 
and  the  likelihood  that  the  enzymes  in  the  two  structures  are 
poised  in  different  oxidation  states.5 

An  outstanding  issue  of  considerable  interest  concerning  the 
structure  of  the  H-cluster  is  the  chemical  identity  of  the 
nonprotein  dithiolate  ligand.  Because  the  H-cluster  is  a  small 
component  of  the  metalloprotein,  attempts  to  unequivocally 
determine  the  composition  of  its  dithiolate  ligand  have  thus  far 
been  unsuccessful.  However,  this  ligand  has  been  of  significant 
interest  due  to  implications  of  its  composition  on  the  electronic 

(1)  Vignais,  P.  M.;  Billoud.  B.;  Meyer,  J.  FEMS  Microbiol.  Rev.  2001,  25  (4), 
455-501. 

(2)  Adams,  M.  W.  W.  Biochim.  Biophys.  Ada  1990.  1020  (2),  1 15-145. 

(3)  Peters,  J.  W.;  Lanzilotta,  W.  N.;  Lemon.  B.  J.;  Seefeldt.  L.  C.  Science  1998. 
282  (5395),  1853-1858. 

(4)  Nicolet,  Y.;  Piras,  C\;  Legrand,  P.;  Hatchikian,  C.  E.;  Fontecilla-Camps,  J. 
C.  Struct.  Fold  Des.  1999,  7(1),  13-23. 

(5)  Nicolet,  Y.;  Lemon,  B.  J.;  Fontecilla-Camps,  J.  C.;  Peters,  J.  W.  Trends 

Biochem.  Sci.  2000.  25  (3),  138-143. 


structure  of  the  H-cluster  and  the  mechanism  of  dihydrogen 
uptake  and  evolution.  In  the  first  reported  structure  of  an  [FeFc]- 
hydrogenase  isolated  from  C.  pasteurianum ,3  the  non-sulfur 
atoms  of  the  dithiolate  ligand  were  described  as  a  covalent 
linkage  of  light  atoms.  In  the  characterization  of  the  [FcFe]- 
hydrogenase  from  D.  desulfur  icansf  this  ligand  was  originally 
assigned  as  propanedithiolate  but  later  revised  to  dithiomethy- 
lamine.6  Direct  experimental  evidence  concerning  the  composi¬ 
tion  of  the  ligand  has  not  yet  been  presented,  and  assignment 
of  the  secondary  amine  group  in  the  ligand  was  based  on  its 
ability  to  serve  as  a  proton  donor  or  acceptor  during  catalysis. 
Regardless  of  the  merits  of  the  attributes  of  an  amine  as  a 
potential  catalytic  base,  it  has  been  shown  that  the  composition 
of  the  ligand  has  a  large  impact  on  the  Fe— S  bonding  of  the 
2Fe-subcluster,  which  directly  affects  the  redox  active  molecular 
orbitals  of  the  H-cluster.7  Thus,  an  imperative  must  be  placed 
on  resolving  this  interesting  outstanding  issue. 

In  this  study,  we  undertook  an  unbiased  investigation  in  which 
we  considered  the  possibility  of  several  dithiolate  compositions 
with  O,  CH2,  NH,  NH2+,  and  S  groups  in  the  y -position  (bridge¬ 
head)  of  the  dithiolate  ligand  and  evaluated  their  quantum 
chemically  optimized  structures  in  comparison  to  an  experi¬ 
mental  crystal  structure  of  1.39  A  resolution.  We  opted  not  to 
use  an  interfaced  computational  and  a  crystallographic  code 
(COMQUM-X/Turbomole  and  CNS,  respectively)8  due  to  the 
current  limitation  of  our  generalized  fragment  method  that 
allows  us  to  obtain  the  correct  electronic  structure  with  spin 


(6)  Nicolet,  Y.;  de  Lacey,  A.  L.;  Vemede,  X.;  Fernandez,  V.  M.;  Hatchikian. 
E.  C.;  Fontecilla-Camps,  J.  C.  J.  Am.  Chem.  Soc.  2001,  123  (8).  1596- 
1601. 

(7)  Schwab,  D.  E.;  Tard,  C.;  Brecht.  E.;  Peters,  J.  W.;  Pickett,  C.  J.;  Szilagyi. 
R.  K.  Chem.  Commun.  2006,  35,  3696-3698. 

(8)  Ryde,  U.  Dalton  Trans.  2007.  6,  607-625. 
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Table  1.  Data  Collection  Parameters 


space  group 
cell  parameters 

total  observations 
unique  reflections 

P422\2 
a  =  b=  110.79  Ac 
a =P=Y= 
622  033 
121  060 

=  103.57  A 

90.0° 

resolution  shell  (A) 

redundancy 

completeness 

avg  ll<7 1 

^symm3 

46.9-4.8 

7.8 

98.2 

31.5 

0.06 

4.80-3.40 

8.7 

99.9 

31.1 

0.07 

3.40-2.77 

7.9 

100.0 

24.7 

0.09 

2.77-2.40 

6.6 

99.9 

20.2 

0.09 

2.40-2.15 

5.7 

99.7 

17.2 

0.10 

2.15-1.96 

5.1 

99.5 

14.4 

0.12 

1.96-1.81 

4.6 

98.9 

11.5 

0.14 

1.81-1.70 

4.2 

97.4 

8.6 

0.17 

1.70-1.60 

3.9 

93.5 

6.2 

0.22 

1.60-1.52 

3.9 

89.0 

4.5 

0.29 

1.52-1.45 

4.1 

84.3 

3.1 

0.41 

1.45-1.39 

4.4 

81.7 

2.5 

0.57 

Overall 

5.1 

93.5 

12.6 

0.09 

°  R.vymm(f)  —  ^hkl  I  A  —  (0  l^hkl  £/  A 


polarized,  open  shell  wave  function.  The  latter  is  required  for 
obtaining  a  proper  magnetic  coupling  between  the  iron  centers 
of  the  complete  H-cluster.  Instead  of  using  structure  factors  from 
the  crystallographic  refinement,  we  present  here  a  simple,  yet 
insightful,  new  approach  for  evaluating  various  compositions 
for  unknown  atoms  or  groups.  The  density  functional  and  basis 
sets  chosen  for  the  given  study  have  already  been  thoroughly 
evaluated  in  a  separate  publication.9  We  present  evidence  for 
dithiomethylether  as  a  bridging  ligand  of  the  2Fe-subcluster. 
Furthermore,  a  conformation  analysis  of  the  orientation  of  the 
y-group  of  the  dithiolate  indicates  that  dithiomethylamine  would 
be  unable  to  function  as  a  catalytic  base  during  catalysis.  The 
presence  of  an  ether  functional  group  opens  up  new  mechanistic 
possibilities  to  be  explored  by  further  synthetic  and  computa¬ 
tional  investigations. 

Experimental  Section 

Crystallographic  Refinement.  The  [FeFe]-hydrogenase  in  this  study 
was  purified  from  C.  pasteurianum  as  described  previously.10  Crystals 
of  the  enzyme  were  obtained  by  the  microcapillary  batch  diffusion 
method"  in  a  precipitation  solution  of  25%  polyethylene  glycol  4000, 
0.1  M  sodium  acetate  (pH  4.6),  and  0.1  M  sodium  sulfate  as  described 
previously/  All  manipulations  were  conducted  in  an  anaerobic  chamber 
at  room  temperature.  Crystals  belonging  to  space  group  P422\2  were 
obtained  in  7  days.  Data  was  collected  at  SSRL  on  a  Q4  detector  at  a 
wavelength  of  0.95364  A.  processed  with  MOSFLM  and  scaled  with 
SCALA.12  The  crystals  were  slightly  sensitive  to  radiation  damage 
during  the  relatively  long  exposure  times  (30  s  to  1  min)  that  were 
necessary  to  collect  the  highest  resolution  data  possible.  The  final  data 
set  consisted  of  a  composite  of  data  from  four  crystals  and  upon  merging 
resulted  in  a  data  set  with  cumulative  /?symm  of  0.09  for  data  up  to  a 
resolution  of  1.39  A  (Table  1). 

Crystals  were  nearly  isomorphous  in  comparison  to  the  P422\2 
crystals  previously  obtained  with  a  =  b  =  1 10.79  A,  c  =  103.57  A,  a 
=  p  =  y  =  90°. 13  For  crystallographic  refinement,  a  randomly  selected 

(9)  Szilagyi,  R.  K.;  Winslow,  M.  A.  J.  Comput.  Chem.  2006,  27  (12),  1 385  — 
1397. 

( 10)  Chen,  J.  S.;  Mortenson,  L.  E.  Biochim.  Biophvs.  Acta  1974.  371  (2),  283  — 
298. 

(11)  Georgiadis,  M.  M.;  Komiya,  H.;  Chakrabarti,  P.;  Woo,  D.;  Komuc,  J.  J.; 
Rees,  D.  C.  Science  1992!  257  (5077),  1653-1659. 

(12)  Collaborative  Computational  Project  Acta  Cnstallogr.,  Sect.  D:  Biol. 
Crystallogr.  1994.  50  (Pt  5),  760-763. 

(13)  Peters,  J.  W.;  Lanzilotta,  W.  N.;  Lemon.  B  J.;  Seefeldt,  L.  C.  Science  1998. 
282  (5395),  1853-1858. 


Rfrec  data  set  (3%)  was  assigned  and  the  Rlrcc  was  monitored  through 
all  stages  of  the  refinement.  Refinement  was  carried  out  with  the 
program  SHELX14  using  conjugate  gradient  least-squares15  and  block 
diagonal  matrix  least-squares  methods.  Protein  coordinates  and  isotropic 
B  factors  were  refined  using  conjugate  least-squares  method  with  protein 
bonds  and  angles  restrained  to  target  values  from  Engh  and  Huber.16 
Restraints  for  the  iron-sulfur  clusters  were  generated  using  SHELX- 
PRO.14  Anisotropic  B  factors  for  all  atoms  were  implemented  for 
subsequent  steps  of  refinement  resulting  in  a  drop  of  3%  in  /?fIcc. 
Coordinates  were  fitted  to  the  2 Fo-Fc  and  Fo-Fc  maps  in  XTALVIEW17 
and  alternate  amino  acid  side  chain  conformations  were  added  where 
applicable.  Hydrogen  atoms  were  added  as  fixed  atoms.  Another  round 
of  conjugated  least-squares  refinement  was  carried  out  with  the  bond 
distances  for  the  H-cluster  adjusted  to  those  obtained  from  a  DFT  energy 
minimization  that  included  the  protein  environment  of  the  cluster  within 
3.5— 3.9  A.  All  reflections  were  included  in  the  final  step  of  the 
refinement  resulting  in  an  RCTys ,  of  13.7%  for  all  reflections  and  1 1 .7% 
for  reflections  >4 o  (Table  3).  All  restraints  were  released  for  the 
calculation  of  standard  uncertainties  in  bond  lengths  and  angles  of  the 
clusters  using  blocked-matrix  least-squares  refinement.  All  main  chain 
and  side  chain  residues  were  included  in  the  refinement  with  a  limited 
number  of  residues  modeled  in  multiple  conformations  and  the  final 
stereochemistry  of  the  model  was  analyzed  using  PROCHECK.1* 
Computational  Details.  An  approximately  200-atom  virtual  chemi¬ 
cal  model  was  constructed  from  the  H-cluster  and  its  3.5— 3.9  A  protein 
environment  including  the  covalently  bound  full  cysteine  residues 
(italicized  residues  in  Table  2).  The  dithiolate  ligand  is  seated  in  a 
protein  cavity  that  is  lined  with  residues  Cys299,  Met497,  Gly4l8, 
Phe417,  Cys503  (Table  2)  and  capped  with  the  2Fe-subcluster.  To  avoid 
unreasonably  large  displacements  relative  to  the  close-to  atomic- 
resolution  crystal  structure,  the  dithiolate  ligand  groups  (bridge-head 
y-group,  /Tmethylene,  //-S)  were  allowed  to  relax  only  stepwise.  All 
computations  were  carried  out  using  Gaussian03  Rev.  D.01 ,19  We  have 
selected  the  gradient-corrected  Becke8820  exchange  and  PcrdcwSb21 
correlation  functionals  and  the  Stuttgart-Dresden  effective  core  poten¬ 
tials  and  valence  basis  set  for  all  atoms.22  This  level  of  theory  has 
already  been  rigorously  validated  for  iron-sulfur  clusters  in  a  previous 
work9  and  has  been  shown  to  reproduce  iron-sulfur  cluster  geometry 
within  0.06  A  rms  error.  The  [4Fe-4S]  cubane  of  the  H-cluster  w  as 
treated  as  an  open  shell  singlet  structure  in  the  +2  core  charge  state 
with  two  antiferromagnetically  coupled  rhombs  of  Ms  =  ±9/2.  Using 
the  1.39  A  resolution  structure  of  the  H-cluster,  24  possible  magnetic 
coupling  schemes  have  been  evaluated  by  our  ionic  fragment  approach,9 
and  only  the  lowest  coupling  was  utilized  in  this  study.  The  magnetic 
coupling  schemes  have  also  been  evaluated  with  a  more  accurate 
functional  and  basis  set  (termed  B5HFP86/BS5+  in  ref  9)  without 
observing  any  significant  difference  in  the  relative  energies  of  magnetic 
states.  The  2Fe-subcluster  was  described  by  a  low  spin  Fe,nVFe<l)  pair 
in  the  S  =  x/2  state.  We  have  considered  all  possible  dithiolate 
compositions  including  the  partially  and  fully  protonated  forms  of  the 
secondary  amine  group.  For  the  sake  of  clarity  of  the  plots  and  the 
discussion,  the  results  obtained  for  the  O,  CH:,  and  NH:  compositions 
are  presented  in  the  text  and  the  results  for  the  rest  (S  and  NH  in  two 
different  arrangements  where  the  secondary  amine  group  is  11-bonding 
to  the  distal  water  and  vice  versa )  are  given  as  Supporting  Information. 


(14)  Sheldrick,  G.  M.;  Schneider.  T.  R.  Methods  Enzvmol.  1997.  277 ,  3 19- 
343. 

(15)  Konnert,  J.  H.;  Hendrickson,  W.  A.  Acta  Cnstallogr.  1980.  A36  (May), 
344-350. 

(16)  Engh,  R.  A.;  Huber,  R.  Acta  Cnstallogr.  1991,  A47  (4),  392-400 

(17)  McRee,  D.  E.  J.  Struct.  Biol.  1999,  125  (2-3),  156-165. 

( 18)  Laskowski,  R.  A.;  McArthur,  M.  W.;  Moss,  D.  S.;  Thornton.  .1.  M. ./  Appl. 
Cryst.  1993,  265  ( 2),  283-291. 

(19)  Frisch,  M.  J.;  et  al.  Gaussian  03 ,  Rev.  D01;  Gaussian,  Inc.:  Wallingford. 
CT,  2006. 

(20)  Becke,  A.  D.  Plivs.  Rer.  A  Gen  Phys  1988.  38  (6),  3098-3100. 

(21)  Perdew,  J.  P.  Phys.  Rev.  B:  Cond.  Mater.  1986.  33  (12),  8822-8824. 

(22)  Dolg,  M  ;  Wedig,  U.;  Stoll.  H.;  Preuss,  H.  J  Chem  Pins  1987.  86  (2). 
866-872. 
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Table  2.  Detailed  List  of  Interactions  between  the  Protein  and  the  H-Cluster  Considered  for  the  Computational  Model 


residue*  molecular  model  close  contacts/interactions 


GLNI95 

H:NCO 

CO— s 

3.33  A 

dipole  interaction  with  S(Cys499) 

HOH641 

h2o 

0— s 

3.00  A 

H-bonding  to  S(Cys355) 

GLY302TRP303 

H:NCO 

N— S 

3.15  A 

H-bonding  to  S(Cys300) 

CYS499 

SCH2CH(CONH:)NHCHO 

N— S 

3.29  A 

H-bonding  N(Ala498)  to  S([4Fc— 4S1) 

CO— s 

4.04  A 

dipole  interaction 

THR356 CYS355 

SCH2CH(NHCHO)CONHCH2CONH2 

N— S 

3.14  A 

H-bonding  N(Scr357)  to  S(Cys355) 

CO— s 

4.04  A 

dipole  interaction 

CYS300CYS299h 

SCH2CH(CONH2)NHCOCH:CH2SH 

CO— 0 

2.94  A 

H-bonding  peptide  to  NH3  (Lys358) 

possible  distal  watcr-S(Cys299)  interaction 

CYS503h 

SCH2CH(NHCHO)CONH2 

bridging  cysteine 

HOH17* 

h2o 

O— N 

3.11  A 

H-bonding  to  amide  (Cys503) 

MET497* 

H3CSCH3 

stcric  within  3.5  A 

bottom  cap  at  the  dithiolatc  ligand 

GLY418/’ 

HONCHO 

CO— s 

3.71  A 

dipole  interaction  with  dithiolatc 

PHE41 7* 

c6h6 

c— 0 

3.21  A 

aromatic  H-bonding  to  the  distal  water 

LYS358 

H3CNH3 

N— N 

2.74  A 

H-bonding  to  distal  CN 

MET353 

H3CSCH3 

s— 0 

3.15  A 

dipole  with  bridging  CO 

ALA320 

H3CCH3 

stcric  within 

3.5  A 

top  cap  at  bridging  carbonyl 

SER232 

H3COH 

O— N 

2.92  A 

H-bonding  to  proximal  CN 

PR0324 

[N(CHO)CH(CONH2)C3H6] 

N— N 

2.90  A 

H-bonding  to  distal  CN 

stcric  within  3.5  A 

distal  cap  to  the  2Fc-subclustcr 

Ligands  to  H-clustcr  arc  italicized.  h  Residues  around  the  dithiolatc  ligand. 


Table  3.  Progress  of  Refinement  of  the  Previously  Published  Structure  of  CpI  as  Model  against  the  1 .39  A  Data  with  SHELXLa 


resolution 

(A) 

A 

H 

W 

Par 

Obs 

Rcryst  % 

(all) 

(all) 

Rc rys.% 

(>4,7) 

Rree% 

(>4,7) 

rigid  body 

2.5-46 

4513 

0 

0 

4522 

22036 

32.9 

34.2 

33.8 

32.3 

coordinates,  isotropic  B-factors 

1.45-10 

4844 

0 

331 

19377 

102188 

21.9 

25.4 

20.0 

23.5 

addition  of  waters 

1.45-10 

4940 

0 

427 

19761 

102188 

20.4 

24.3 

18.5 

22.4 

anisotropic  B-valucs 

1.45-10 

4942 

0 

428 

44477 

102188 

15.6 

21.2 

13.9 

19.5 

rebuilding  +  water  -1-  glycerol  molecules 

1.45-10 

5208 

0 

632 

45646 

102188 

13.9 

19.9 

12.3 

18.2 

include  all  data  +  anisotropic 

1.39-10 

5208 

0 

632 

46871 

113858 

14.1 

20.1 

12.1 

18.0 

refine  occupancies  of  split  residues 

1.39-10 

5240 

0 

654 

47045 

113858 

14.0 

20.1 

12.0 

18.0 

add  Hs 

1.39-10 

5251 

4300 

667 

47219 

113858 

13.2 

19.4 

11.8 

17.8 

final  refinement 

1.39-10 

5251 

4300 

667 

47219 

119840 

13.7 

- 

11.7 

- 

"  All  reflections  were  included  during  the  final  step  of  refinement.  Columns  include  A  -  the  number  of  non-hydrogen  atoms,  H  -  number  of  hydrogen 
atoms,  W  -  number  of  water  molecules.  Par  -  number  of  parameters,  and  Obs  -  number  of  observations. 


To  evaluate  the  likelihood  of  the  protonation,  we  carried  out 
calculations  for  estimating  the  free  energy  of  hydrolysis  of  the  secondary 
amine  bridge-head  group  of  the  dithiolate  ligand.  In  addition  to 
completing  the  first  solvation  shell  of  a  solute  (S)  by  hydrogen-bonded 
explicit  water  molecules,  all  solvation  models  were  embedded  in  a  low 
dielectric  environment  of  e  =  10  with  solvent  radius  of  1.385  A. 
Consequently,  the  structure  of  the  S*n  H20  construct  was  fully  opti¬ 
mized.  Free  energy  corrections  to  the  electronic  energy  were  obtained 
from  vibrational  analysis  and  polarizable  continuum  calculations,23 
respectively.  To  validate  this  approximation,  we  also  calculated  the 
pK*  of  a  primary  amine  and  methylthiolate  at  the  same  level  of  theory 
and  employing  the  same  solvation  models  that  are  in  good  agreement 
with  common  experimental  values  (10.7  and  7.5,  respectively). 

Although  recently  a  more  sophisticated  method  has  been  established 
for  computational  refinement8  of  protein  crystal  structures  that  takes 
into  account  the  experimental  structure  factors  we  argue  that  gradually 
relaxed,  partial  geometry  optimizations  already  have  the  potential  to 
distinguish  between  various  chemical  compositions.  Replacement  of 
an  ether  group  with  a  methylene  or  a  secondary  amine/ammonium  group 
is  expected  to  induce  a  considerable  perturbation  in  the  structure  of 
the  dithiolate  ligand  that  can  be  quantified  by  comparing  relative 
energies,  initial  gradients  and  forces,  and  the  internal  coordinates 
between  the  best  crystallographic  and  the  quantum  chemistry  optimized 
atomic  positions.  Naturally,  a  smaller  energy  change,  smaller  initial 
gradients  and  forces,  as  well  as  less  deviation  from  the  crystallographic 


(23)  Tomasi,  J.;  Mennucci,  B.;  Cammi.  R.  Chetn.  Rev  2005.  105  (8).  2999- 
3093. 


positions  would  correspond  to  a  more  likely  stoichiometry.  We 
established  a  background  for  our  approach  by  repeating  the  same  set 
of  optimization  for  two  small  molecule  mimics  [Fe2(COMpdt)]24  (1) 
and  [Fe:(CO)6(dtme)]25  (2),  where  pdt  is  propanadithiolate  and  dtme 
is  dithiomethylether,  as  those  used  for  the  protein-embedded  H-cluster 
in  this  paper.  As  a  proof  of  concept,  first  both  complexes  1  and  2  were 
fully  optimized  to  obtain  the  most  reasonable  in  vacuo  structures  (Figure 
SI  A,  top  dotted  lines  and  hollow  circles  for  1,  Supporting  Information). 
Interchanging  the  bridge-head  CH2  and  O  groups  in  pdt  and  dtme 
containing  complexes  with  O  and  CH2,  respectively,  costs  about  5  kJ/ 
mol  (stage  A).  Further  optimization  of  the  /5CH2  and  the  bridge-head 
y-groups  together  (stage  B)  w  ith  the  rest  of  the  atomic  positions  kept 
fixed  corresponds  to  about  3 1  kJ/mol  further  energy  stabilization  relative 
to  the  initial  in  vacuo  optimized  structures.  Additional  optimization 
of  the  bridging  thiolate  S  atoms  (stage  C)  contributes  to  the  relaxation 
to  a  minor  extent  (2  kJ/mol)  due  to  the  geometric  similarities  of  the 
Fe2(CO)ftS2  moieties  (Figure  SIB  and  C,  Supporting  Information) 
in  complexes  1  and  2.  These  clearly  indicate  that  change  in  the  chem¬ 
ical  compositions  of  the  bridge-head  group  should  result  in  signi¬ 
ficant  energy  differences  that  are  greater  than  the  error  limit  of 
computations. 

Interestingly,  the  same  approach  showed  preference  of  the  dithiom¬ 
ethylether  composition  for  complexes  I  and  2  when  the  experimental 


(24)  Lyon,  E.  J.;  Georgakaki,  I.  P.;  Reibenspies,  J.  H.  Darensboure.  M.  Y  An^ew 
Chem..  lnt.  Ed  1999.  38  (21),  3178-3180. 

(25)  Song,  L.  C.;  Yang,  Z.  Y.;  Bian,  H.  Z.;  Hu,  Q.  M.  Organometallics  2004. 
23  (13),  3082-3084. 
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crystal  structures  were  used  as  starting  geometries  (Figure  SI,  bottom 
dashed  lines  and  half  filled  circles.  Supporting  Information).  This  seems 
to  indicate  that  the  dithiomethylether  ligand  has  a  less  strained  geometric 
structure  to  fit  between  the  two  iron  ions  in  the  Fe2(CO)6  fragment. 
However,  this  simplified  evaluation  needs  to  be  taken  with  caution, 
because,  as  mentioned  above  for  the  in  vacuo  optimized  structures, 
the  dominant  structural  difference  between  the  crystal  structures  of 
complexes  1  and  2  are  not  intrinsic  to  the  small  molecule  complex 
(Figure  SIB  and  C,  Supporting  Information),  but  they  are  due  to  the 
different  supramolecular  interactions26  that  give  rise  to  crystal  packing. 
To  validate  this,  we  carried  out  gradually  relaxed,  partial  optimizations 
using  a  225  atom  containing  fragment  of  the  actual  crystal  structure  of 
1  (coordinates  are  given  as  Supporting  Information).  In  this  computa¬ 
tional  model,  the  bridge-head  group  (O  or  CH2,  stage  A),  then  the 
bridge-head  and  /iCH2  groups  (stage  B),  and  last  the  full  dithiolate 
ligand  (stage  C)  of  a  central  molecule  were  gradually  relaxed  (Figure 
SI  A,  middle  solid  lines  and  filled  circles,  Supporting  Information).  This 
central  molecule  was  surrounded  by  eight  identical  molecules  frozen 
in  their  crystal  positions  (Figure  SID,  Supporting  Information).  This 
large  model  that  closely  resembles  in  size  and  complexity  of  the  protein 
environment-embedded  H-cluster  now  shows  a  preference  of  the  CH2 
group  over  the  O  group  in  the  pdt  containing  complex  of  1  throughout 
the  optimization.  The  smaller  difference  between  the  crystal  packing 
embedded  models  relative  to  the  in  vacuo  models  is  due  to  the  weak 
intermolecular  (dominantly  dipole  electrostatic)  interactions  between 
a  central  and  its  surrounding  Fe2(CO)<,(SCH2— CH2/0— CH2S)  mol¬ 
ecules. 

It  is  also  important  to  emphasize  that  in  this  study  we  have  used  a 
superior  approach  to  the  popular  QM/MM  approach  by  incorporating 
the  complete  3.5— 3.9  A  protein  environment  (see  Table  2)  into  the 
computational  model  that  is  treated  at  quantum  chemical  level.  We 
have  already  shown  for  a  structurally  less  complex  bioinorganic  active 
site  in  galactose  oxidase27  than  the  H-cluster  that  this  is  crucial  for 
reproducing  experimental  spectroscopic  (UV— vis,  magnetic  coupling, 
ground  state)  and  energetic  (one  and  two  electron  redox  potentials) 
parameters. 

Results  and  Discussion 

Crystallographic  Refinement.  In  the  current  work,  we  have 
now  refined  the  structure  of  the  [FeFe]-hydrogenase  from  C. 
pasteurianum  to  1.39  A  resolution.  This  improved  resolution 
permits  the  inclusion  of  hydrogen  atoms,  the  assignments  of 
anisotropic  ^-factors,  and  calculation  of  individual  atomic 
uncertainties  in  the  range  of  those  observed  for  small  molecule 
mimics.28-30  The  final  model  exhibited  comparable  stereochem¬ 
istry  of  the  structure  to  that  observed  in  the  previous  structure 
detennined  to  1.8  A  resolution  with  relative  mean  standard 
deviations  from  ideality  in  bond  lengths  and  bond  angles  of 
0.010  A  and  2.11°,  respectively.  A  Ramaehadran  analysis  of 
the  structure  indicated  that  all  residues  were  found  to  be  in  either 
most  favored  (90%)  or  additionally  allowed  (10%)  regions.18 
The  overall  B-i actors  for  the  protein  were  17.1  A3,  with  main 
chain,  side  chains,  iron  sulfur  clusters,  and  solvent  atoms  having 
^-factors  of  14.0,  20.4,  9.5,  and  33.7  A3,  respectively.  The 
current  study  is  focused  on  the  analysis  of  the  H-cluster 
environment,  and  a  more  extensive  analysis  of  high-resolution 
[FeFe]-hydrogenase  structure  is  underway. 

(26)  Lehn,  J.  M.  Science  1993.  260  (51 15),  1762-1763. 

(27)  Rokhsana,  I).;  Dooley.  D.  M.;  Szilagyi,  R.  K.  J.  Am.  Chem.  Soc.  2006. 

128  (49),  15550-15551. 

(28)  Darensbourg,  M.  Y.;  Lyon,  E.  J.;  Zhao,  X.;  Georgakaki,  I  P  Proc.  Natl. 

Acad.  Sci.  US  A  2003^  100  (7),  3683-3688. 

(29)  Rauchfuss,  T.  B.  Research  on  soluble  Metal  Sulfides:  lnorg.  Chem.  2004. 

43  (1),  14-26. 

(30)  Tard,  C.;  Liu,  X.  M.;  Ibrahim,  S.  K.;  Bruschi,  M.;  De  Gioia,  L.;  Davies,  S. 

C.;  Yang,  X.;  Wang.  L.  S.;  Sauers.  G  :  Pickett,  C  J  Nature  2005.  433 

(7026).  610-613. 


Figure  1.  Stcrcovicw  of  the  H -cluster  superimposed  to  2Fo-Fc  electron 
density  map  at  contour  level  of  1.5a  (A);  ORTHP  presentation  of  the  2Fc 
cluster  (B). 


The  electron  density  maps  in  this  work  are  of  excellent 
quality,  and  the  majority  of  H-cluster  atoms  are  resolved  to 
atomic  resolution  (Figure  1A).  The  most  characteristic  bond 
lengths  and  intramolecular  distances  summarized  in  Figure  2 
can  be  compared  to  small  molecule  mimics  of  the  2Fe- 
subcluster.  The  histograms  of  Fe,#*Fe  (brown),  Fe— S  (orange), 
Fe— CO  (red),  and  Fe— CN  (blue)  distances  of  selected  2Fe 
cluster  mimics  with  both  CO  and  CN  ligands  and  a  bridging 
thiolate  were  obtained  from  the  Cambridge  Crystallographic 
Database.31  Although  the  average  Fe***Fe  distances  in  the  small 
molecule  mimics  are  highly  similar  (2.540  zb  0.030  A)  to  those 
in  the  refined  H-cluster  (2.551  zb  0.007  A,  Figure  2).  The 
different  Fe  oxidation  states  and  the  lack  of  the  bridging  CO 
ligand  in  the  former  are  reflected  in  the  slightly  longer  average 
Fe-S  distances  (2.288-2.320  ±  0.010  A)  of  the  H-cluster 
relative  to  the  biomimetic  compounds  (2.278  ±  0.034  A). 
However,  the  average  Fe— CO  and  Fe— CN  distances  seem  to 
agree  rather  well  with  those  of  the  small  2Fe  clusters  ( 1 .768  db 
0.024  and  1.925  zb  0.028  A,  respectively).  The  near-atomic 
resolution  structure  allows  for  the  differentiation  between  the 
terminal  CO  and  CN“  ligands,  because  the  former  arc  generally 
located  closer  to  a  metal  center  than  the  latter  (see  histograms 
in  Figure  2).  The  refined  internal  coordinates  highlighted  in 
Figure  2  of  the  [4Fe-4S]-subcluster  agree  rather  well  with  the 
analogous  tetrathiolate  coordinated  [4Fe-4S]  clusters  (Fe—Fe 
2.764  ±  0.100  A,  Fe— S(sulfide)  2.294  zb  0.041  A,  Fe-S(thio- 
late)  2.271  zb  0.047  A,  see  distribution  plots  in  Figure  2). 

Despite  the  close-to  atomic  resolution  of  the  current  structure, 
the  chemical  composition  of  the  dithiolate  ligand  of  the  H-cluster 
cannot  be  unambiguously  assigned.  Although  the  proximal  (/i-) 
groups  of  the  dithiolate  are  rationally  assumed  to  be  methylene, 
there  is  still  a  debate  on  the  composition  of  the  central  (y-) 
dithiolate  atom  or  group.  Therefore,  we  carried  out  in  silico 
model  building  and  density  functional  theory  calculations  to 

(31)  Allen,  F.  H.  Acta  Cryst.  2002.  B58,  (3,  Sp.  Issue  1).  380—388;  (version 
5.29;  November  2007). 
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2  Fe-subc  luster 


Fei...Fe2 

2.551 

± 

0.007 

Fe2...OH2 

2.379 

± 

0.038 

Fe,-S| 

2.288 

± 

0.010 

Fei-S2 

2.300 

± 

0.010 

F  e  i  -Sy503 

2.391 

± 

0.008 

Fe2-Si 

2.320 

± 

0.010 

Fe2-S2 

2.300 

± 

0.010 

Fei-C(O) 

1.740 

± 

0.039 

Fei-C(N) 

1.825 

± 

0.036 

Fe,-C(0)b 

1.906 

± 

0.035 

Fe2-C(0)b 

1.991 

± 

0.030 

Fe2-C(N) 

1.929 

± 

0.034 

Fe2-C(0) 

1.765 

± 

0.005 

[4Fe-4S]-subcluster 
Fe-Fe  2.666-2.844  ±  0.006 

Fe-S  2.231-2.387  ±0.009 

Fe-Sy  2.273-2.303  ±  0.008 


Figure  2.  Selected  interatomic  distances  within  the  H-clustcr  and  atom  labeling  (A)  adapted  for  the  H-clustcr.  Histograms  of  Fc*”Fc,  Fc-S,  Fc— CO, 
Fc— CN  and  Fc***Fc,  Fc— S(sulfidc),  Fc— S(thiolatc)  distances  within  a  selected  set  of  2Fc-subclustcr  mimics  (B)  and  //’-tctrasulfldo-tctrathiolato-tctrairon 
clusters  (C). 


evaluate  the  energetic  and  structural  implications  of  various 
compositions  (CFb,  NH,  NFb'1',  O,  and  S)  of  the  y-group  at  the 
center  of  the  dithiolate  ligand. 

As  emphasized  in  the  Computational  Details,  an  accurate 
representation  of  the  protein  environment  as  a  network  of 
supramolecular  interactions  is  essential  for  capturing  the  most 
important  covalent,  electrostatic,  dipole,  and  hydrogen  bonding 
interactions  that  all  contribute  to  the  structure  and  stability  of 
the  H-cluster.  Table  2  summarizes  the  strongest  intermolecular 
interactions  identified  in  the  1.39  A  resolution  structure  between 
the  H-cluster  and  its  protein  environment.  As  a  result  of  this 
mapping,  we  found  considerably  more  interactions  for  the  2Fe- 
subcluster  than  the  4Fe-subcluster  of  the  H-cluster,  which  is 
expected  due  to  the  lack  of  a  covalently  bound  residue  to  the 
2Fe-subcluster.  Notably,  the  greatest  number  of  interactions 
(marked  with  superscript  b  in  Table  2)  involve  the  dithiolate 
ligand.  It  is  important  to  mention  that  the  dithiolate  ligand  in 
the  protein  bound  H-cluster  is  involved  in  a  more  sophisticated 
and  stronger  network  of  intermolecular  interactions  than  the 
small  molecule  mimics  discussed  in  the  Computational  Details 


section  (see  above).  The  4Fe-subcluster  has  only  one  H-bonding 
interaction  to  a  sulfide  and  five  dipole  and/or  H-bonding 
interactions  to  the  thiolate  groups  of  covalently  bound  cysteine 
residues.  These  interactions  are  expected  to  be  influential  in 
tuning  the  redox  potentials  of  the  4Fe-sublcuster.  About  six 
important  interactions  surround  the  diatomic  ligands  in  addition 
to  residues  Met353,  Phe417,  Ala320,  and  Pro324  lining  the  wall 
of  the  protein  cavity  around  the  2Fe-subcluster.  The  position 
of  the  Cys299  residue  is  notable,  because  it  is  in  H-bonding 
distance  from  the  distal  water  and  also  connects  the  pool  of 
water  molecules  adjacent  to  the  distal  Fe  site  (Fe:)  with  the 
distal  water. 

Computational  Evaluation  of  Various  Dithiolate  Compo¬ 
sitions.  Figure  3  presents  the  energy  stabilization  as  first  the 
y-,  next  the  /?-,  and  last  the  //-sulfur  groups  of  the  dithiolate 
ligand  were  gradually  optimized  while  the  positions  of  the  rest 
of  the  atoms  were  kept  fixed.  In  the  first  set  of  optimizations 
(section  1),  only  the  distal  water  was  allowed  to  move.  It  was 
anticipated  that  the  presence  of  the  distal  water  could  bias  the 
optimizations  due  to  hydrogen  bonding  interactions  to  the 
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Figure  3.  Hncrgy  stabilization  upon  sequential  structural  optimization  of  section  I  (distal  water),  II  (water  +  y-group),  III  (water  +  y-group  +  /j-CH:),  and 
IV  (water  -f-  y-group  +  ft-CH:  +/y  S)  of  the  dithiolatc  ligand  of  the  H-clustcr  embedded  in  a  3.5  A  protein  environment.  (Insets)  Optimized  cluster  structures 
with  selected  intcrmolccular  distances  between  the  distal  iron— distal  water  and  bridge-head  group-distal  water. 


y-group.  To  test  this,  we  also  carried  out  optimizations  in  the 
absence  of  the  distal  water  (dotted  lines/hollow  circles  in  Figure 
3  starting  in  section  II)  in  addition  to  optimizing  the  positions 
of  the  y-group  and  the  distal  water  together  (not  shown)  without 
reaching  a  different  conclusion  regarding  the  preference  for  the 
y-group  composition.  The  changes  in  two  characteristic  dis¬ 
tances  are  presented  in  Figure  4,  and  their  optimized  values  are 
shown  in  the  insets  of  Figure  3.  The  optimized  coordinates  of 
computational  models  are  given  as  Supporting  Information.  For 
the  dithiomethylammonium-  and  dithiomethylether-containing 
models,  the  distances  of  the  distal  water  oxygen  and  the  y-group 
stay  close  to  the  experimental  value  of  2.60  A.  This  distance  is 
indicative  of  a  strong  hydrogen  bonding  interaction.  From  the 
structural  insets  in  Figure  3,  the  hydrogen  bonding  interactions 
between  the  NH2+  or  O  groups  and  the  distal  water  are  actually 
different.  In  the  former,  the  distal  water  is  a  hydrogen  bond 
acceptor  and  donor  in  the  latter.  By  comparing  the  water  oxygen- 
distal  iron  distances,  the  dithiomethylether  ligands  remains  close 
to  the  experimental  value  (2.33  A  vs  2.38  A),  whereas  in  the 
other  cases,  it  moves  away  considerably  from  the  distal  Fe 
(CFF:  3.38  A  and  NH2+:  3.07  A).  Although  the  electron  density 
feature  and  the  thermal  ellipsoid  (Figure  IB)  suggest  dynamic 
flexibility  and  anisotropy  for  this  water  molecule,  the  crystal¬ 
lographic  data  would  not  support  the  assignment  of  the  water 
molecule  at  the  longer  oxygen— iron  distances  observed  for 
models  with  CH2  and  NH2^  groups.  The  energy  change  upon 
optimization  is  considerable  less  for  the  dithiomethylether  ligand 
(O:  -16  kJ/mol)  than  for  the  others  (CH2:  —92  and  NH2+: 
—  73  kJ/mol)  due  to  the  good  agreement  between  the  optimized 
and  experimental  structures.  Further  optimization  of  the  dithi- 
olate  y-  (section  11,  O:  —28,  CH2:  - 1 50,  NH2+:  -96  kJ/mol), 
/i-groups  (section  III,  O:  -175,  CH2:  -321,NH2+:  -269  kJ/ 


mol),  and  last  the  thiolate  sulfur  atoms  (section  IV,  O:  —210, 
CH2:  —349,  NH2+:  —297  kJ/mol)  clearly  maintains  the 

preference  for  an  O  linkage  relative  to  CH2  or  NH;f  groups  at 
the  y-position  of  the  dithiolate  ligand. 

For  the  sake  of  clarity  and  simplicity,  only  the  results  obtained 
for  the  three  dithiolate  compositions  with  O,  CH2,  and  NH;f 
y-groups  are  presented  in  Figures  3  and  4.  Figure  S2  (Supporting 
Information)  provides  a  more  complete  analysis  of  additional 
dithiolate  y-group  composition  including  thioether  (S)  and 
nonprotonated  amine  (NH)  as  well  as  optimization  results  up 
to  the  entire  6Fe  cluster  embedded  in  the  protein  environment. 
The  nonprotonated  secondary  amine  group  was  examined  in 
two  different  conformations  in  which  the  NF1  group  acts  as  either 
a  hydrogen  bond  donor  (NH  w/hydrogen  bonding;  cyan  trace) 
or  acceptor  (NH  w/o  hydrogen  bonding;  teal  trace).  From  the 
energetic  and  metric  differences  between  the  two  protonated 
forms  in  Figure  S2A— C  (Supporting  Information),  the  latter 
seems  to  be  more  favorable,  yet  it  is  less  favorable  than  the 
dithiomethylether  composition  in  all  respects  of  the  analysis. 
Comparing  all  the  computational  results,  it  is  evident  that  the 
dithiomethylether  composition  corresponds  to  the  least  energy 
stabilization  (Figure  2A)  and  the  least  deviations  in  internal 
coordinates  (Figure  S2B— C,  Supporting  Information)  upon 
structural  optimization  relative  to  the  1 .39  A  resolution  structure 
of  the  H-cluster.  It  is  also  interesting  to  note  that  the  distal  water 
position  is  best  reproduced  if  it  is  hydrogen  bonding  to  the 
bridge-head  y-group.  This  is  only  possible  if  this  group  is  cither 
O  or  nonprotonated  secondary  amine. 

It  is  expected  that  upon  binding  of  a  secondary  amine 
substituted  dithiolate  to  two  positively  charged  transition  metal 
ions  the  heat  of  hydrolysis  of  the  amine  group  will  decrease 
and  thus  the  p Ka  shift  toward  smaller  values  suggesting  reduced 


4538  J.  AM.  CHEM.  SOC.  ■  VOL.  130,  NO.  13,  2008 


Dithiomethylether  as  a  Ligand  in  the  H-Cluster 


ARTICLES 


optimization  cycles 


fragments 

relaxed 


y-group 

optimization  cycles 


Y-group 

(J-CH2 


y-group 
P-CH2  m-S 


fragments  y-group  y-group  y-group 

relaxed  (VCH2  p-CH2  p-S 

Figure  4.  Interatomic  distances  as  a  function  of  optimization  steps  between  dithiolatc  y-group  (A),  distal  iron-site  (B),  and  the  distal  water  (distal- W). 


propensity  for  protonation.  Using  small  molecule  mimics  of  the 
2Fe-subcluster  (Fe2S:(CO)6(dtma)),  we  estimated  that  the  p Ka 
value  of  the  dithiomethylamine  in  the  most  reduced  Fe'Fe1  state 
to  be  about  half  (5.1)  of  a  free  secondary  amine  (Me^NH.  10.1) 
in  a  simulated  low  dielectric  environment  (e  =  10)  with 
complete  first  aqueous  solvation  shell  (see  Computational 
Details).  These  calculations  show  that  protonation  of  the 
secondary  amine  group  in  the  most  reduced  Fe'Fe1  state  is  not 
likely. 

Because  a  great  deal  of  simplification  had  to  be  introduced 
in  the  above  p Ka  calculations,  we  further  evaluated  the  confor¬ 
mational  stability  of  the  dithiolate  ligand  for  a  protonated  versus 
nonprotonated  ammonium  group.  Both  [FeFe]-hydrogenase 
crystal  structures  show  that  the  y-group  at  the  center  of  the 
dithiolate  ligand  is  oriented  toward  the  distal  iron.  The  y-group 
is  not  sterically  constrained  in  the  dimetalladithiabicyclohexane 
ring  system;  thus,  it  is  fluxional  under  ambient  conditions,  as 
has  been  shown  by  detailed  temperature-dependent  NMR 
studies.32  Partial  structural  optimizations  of  an  alternate  con¬ 
formation  for  the  y-group  in  sections  II,  III,  and  IV  of  Figure 
3  indicate  that  the  conformation  observed  crystallographically 


is  the  most  stable  for  all  bridge-head  group  compositions  except 
NH2+  In  contrast,  a  10—14  kJ/mol  more-favored  conformation 
of  the  dithiomethylammonium  ligand  exists  with  the  secondary 
amine  group  pointing  toward  the  thiolate  S  of  Cys503  (Figure 
5).  The  relative  energies  of  these  conformers  remained  the  same 
when  the  model  structures  were  fully  optimized  (Figure  S2A 
—  syn,  dashed  blue  line;  anti,  solid  blue  line.  Supporting  Infor¬ 
mation).  The  energetically  favored  alternative  conformation  for 
the  dithiomethylammonium  ligand  can  be  considered  as  a  non¬ 
productive  catalytic  base  because  it  would  shut  down  the  proton 
shuttle  to  the  distal  iron  site.  Furthermore,  it  also  competes  with 
the  proximal  iron  site  of  the  2Fe-sublcuters  for  the  formally 
negatively  charge  bridging  cysteine  sulfur  group  that  can 
compromise  the  structural  integrity  of  the  H-cluster  framework. 

Conclusions 

The  close-to  atomic  resolution  crystal  structure  of  the  [FeFe]- 
hydrogenase  from  C.  pasteurianum  provided  us  a  unique 


(32)  Lyon,  E.  J.;  Georgakaki,  I.  P.;  Reibenspies,  J.  H.;  Darensburg,  M.  Y  J. 
Am.  Chem.  Soc.  2001,  123  (14),  3268-3278. 
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Figure  5.  Anti  (NH:  points  away  from  the  4Fc-subclustcr)  and  syn  (NH: 
points  toward  the  4Fc-subclustcr)  conformation  of  the  dithiomcthylammo- 
nium  ligand. 

opportunity  for  critically  evaluating  the  composition  of  the 
dithiolate  ligand  by  an  unbiased  systematic  in  silico  analysis. 
Computations  carried  out  for  a  more  than  200  atom  model  of 
the  H-cluster  and  its  immediate  3.5— 3.9  A  protein  environment 
favor  the  dithiomethylether  composition.  Our  estimated  pKa 
value  of  the  Fe'Fe'-bound  dithiomethylamine  suggests  that  amine 
group  is  not  likely  protonated  under  physiologically  relevant 


conditions.  In  addition,  conformational  analysis  of  the  dithiolate 
ligand  disfavors  the  dithiomethylammonium  as  the  catalytic  base 
for  hydrogen  production  due  to  a  preferred  alternate  conforma¬ 
tion  for  the  secondary  ammonium  group.  This  conformation  has 
not  been  observed  even  in  the  lower  resolution  crystal  structures. 
In  this  alternate  conformation,  the  positively  charged  ammonium 
group  hydrogen  bonds  to  the  sulfur  atom  of  the  bridging  cysteine 
and  thus  effectively  shuts  down  the  catalytically  important 
electron-delocalization  between  the  4Fe-  and  the  2Fe-subclusters 
of  the  H-cluster.  Oxygen  as  the  central  atom  has  not  yet  been 
critically  exploited  by  synthetic  and  computational  biomimctic 
chemistry;  however,  our  results  provide  a  support  for  this  as 
the  next  generation  of  targets  of  H-cluster  models. 
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